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(57) Abstract 



Thislnventiorris directed to^tipiTlibrury inimunizationrwhich-provides met^^ 
for therapeutic and other uses. The methods are useful for obtaining improved antigens that can induce an immune response against 
pathogens, cancer, and other conditions, as well as antigens that are effective in modulating allergy, inflammatory and autoimmune diseases. 
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ANTIGEN LIBR.ARY IMMUNIZATION 



BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention pertains to the field or" methods for developing irnmunogens 
that can induce efilcient immune responses against a broad range of antigens. 

Background 

The interactions bet\veen pathogens and hosts are results of millions of years 
of evolution, during which the mammalian immune system has evolved sophisticated means 
to counterattack pathogen invasions. However, bactenal and viral. pathogens have 
simultaneously gained a number of mechanisms to improve their virulence and survivai in 
hosts, providing a major challenge for vaccine research and development despite the powers 
of modem techniques of molecular and cellular biology. Similar to the evolution of pathogen 
antigens, several cancer antigens are likely to have gained means to downregulate their 
immunogenicity as a mechanism to escape the host immune system. 

Efficient vaccine development is also hampered by the antigenic 
heterogeneity of different strains of pathogeris/driveTi irf part by evolutionary forces as *" " 
means^for the pathogens to escape immune defenses. Pathogens also reduce their 
immunogenicity by selecting antigens that are difficult to express, process and/or transport in 
host cells, thereby reducing the availability of immunogenic peptides to the molecules 
initiating and modulating immune responses. The mechanisms associated with these 
challenges are complex, multivariate and rather poorly characterized. Accordingly, a need 
exists for vaccines that can induce a protective immune response against bacterial and viral 
pathogens. The present invention fulfills this and other needs. 
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SUMMARY OF THE INVENTION 

The present in\'ennon provides recombinant multivalent antigenic 
polypeptides that include a first antigenic determinant from a rlrst disease-associated 
polypeptide and at least a second antigenic determinant from a second disease-associated 
polypeptide. The disease-associaied polvpeptidescan be selected from' the group consistins 
of cancer antigens, antigens associated with autoimmunity disorders, antigens associated 
with inflammatop/ conditions, antigens associated with allergic reactions, antisens 
associated with infectious agents, and other antigens that are associated with a disease 
condition. 

In another embodiment, the invention provides a recombinant antigen librar>' 
that contains recombinant nucleic acids that encode antigenic polypeptides. The libranes are 
tvpically obtained by recombimng at least first and second fonns of a nucleic acid which 
includes a polynucleotide sequence that encodes a disease-associated antigenic poi>peptide,. 
wherein the first and second lonv.s ditTer from each other in two or more nucleotides, to . 
produce a library of recombinant nucleic acids. 

Another embodiment of the invention provides methods of obtaining a 
polynucleotide that encodes a recombinant antigen having improved ability to induce an 
immune response to a disease condition. These methods involve: (J) recombining at least 
first and second forms of a nucleic acid which composes a polynucleotide sequence that 
encodes an- antigenic polypeptide that is associated with the disease- condition, wherein the 
first and second forms differ from each other in two or more nucleotides, to produce a library 
of recombinant nucleic acids; and (2) screening the library to identify at least one optimized 
recombinant nucleic acid that encodes an optimized recombinant antigenic polypeptide that 
has improved ability to induce an immune response to the disease condition. 

These methods optionally further involve; (3) recombining at least one 
optimized recombinant nucleic acid with a further' form* of the nucleic acid, which is the. 
same or different from the first and second forms, to produce a further library of recombinant 
nucleic acids; (4) screening the further library to identify at least one further optimized 
recombinant nucleic acid that encodes a polypeptide that has improved ability to induce an 
immune response to the disease condition; and (5) repeating (3) and (4), as necessar>', until 
the further optimized recombinant nucleic acid encodes a polypeptide that has improved 
ability to induce an immune response to the disease condition. 
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^ 3 

In some embodiments, the optimized recombinant nucleic acid encodes a 
multivalent antigenic polypeptide and the screening is accomplished by expressing the 
library of recombinant nucleic acids in a phage display expression vector such that the 
recombinant antigen is expressed as a fusion protein with a phage polypeptide that is 
5 \ displa yed on a pha.ge particle surface; contacting the phage with a first antibcdy that is . 
specific for a first serotype of the pathogenic agent and selecting those phage that bind to th 
first antibody; and contacting those phage that bind to the first antibody with a second 
antibody that is specific for a second serotype of the pathogenic agent and selecting those 
phage that bind to the second antibody; wherein those phage that bind to the first antibody 
1 0 and the second antibody express a multivalent antigenic polypeptide. 

The invention also provides methods of obtaining a recombinant viral vector 
which has an enhanced ability to induce an antiviral response in a cell. These methods can 
include the steps of: ( 1 ) recombining at least first and second forms of a nucleic acid which 
comprise a viral vector, wherein the first and second forms differ from each other in mo or 
1 5 more nucleotides, to produce a library of recombinant viral vectors; (2) transfecting the ■ 
library of recombinant viral vectors into a population of mammalian cells; (3) staining the 
cells for the presence of Mx protein; and (4) isolating recom.binant viral vectors from ceils 
which stain positive for Mx protein, wherein recombinant viral vectors from positive 
staining cells exhibit enhanced ability to induce an antiviral response. 

20 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows a schematic representation of a method for generating a 
- -chimeric,-muitivalent antigen that has immunogenic regions from multiple- antigens. 

Antibodies to each of the non-chimeric parental immunogenic polypeptides are specific for 
the respective organisms (A, B, C). After carrying out the recombination and selection 
25 methods of the invention, however, a chimeric immunogenic polypeptide is obtained that is 
, . recognized by antibodies raised against each of the three parental irnmunogenic 
polypeptides. 

Figure 2 shows the principle of family DNA shuffling. A, family of antigen 
genes from related pathogens are subjected to shuffling, which results in a library of 
30 chimeric and/or mutated antigens. Screening methods are employed to identify those 
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recombinant antigens that are the most immunogenic and/or cross-protective. These caii,, if 
desired, be subjected to additional rounds of shuffling and screening. ■ 

Figure 3 A-Figure 3B shows a schematic for a method by which one can 
obtain recombinant polypeptides that can induce a broad-spectrum immune response. In 
5 Figure 3 A, wild-type immunogenic polypeptides from the pathogens A, B, and C provide ' . 
protection against the corresponding pathogen from which the polypeptide is derived, but- 
little or no cross-protection against the other pathogens (left panel). After shuffling, an 
A/B/C chimeric polypeptide is obtained that can induce a protective immune response 
...... against all three pathogen types (right panel). In Figure 3B, shuffling is used with substrate 

10 nucleic acids from two pathogen strains (A, B), which encode polypeptides that are 

. protective only against the corresponding pathogen., After shuffling, the resulting chimeric 
polypeptide can induce an immune response that is effective against not only the two 
parental pathogen strains, but also against a third strain of pathogen (C). 

Figure 4 diagrams somepf the possible factors that can determine whether a 
15 particular polynucleotide encodes an immunogenic polypeptide having a desired property, 
such as enhanced immunogenicity and/or cross-reactivity. Those sequence regions that 
. positively affect a particular property are indicated as plus signs along the antigen gene, 
while those sequence regions that have a negative effect are shown as minus signs.^A pool of 
related antigen genes are shuffled and screened, to obtain those that recombinant. nucleic 
20 acids that have gained positive sequence regions and lost negative regions. No pre-existing. - 
knowledge as to which regions are positive or negative for a- particular trait is required. 

Figure 5 shows a schematic representation of the screening strategy for 

antigen library screening. 

Figure 6 shows a schematic representation of a strategy for pooling and . 

25 deconvolution as used in antigen library screening. , • ' 

Figure 7 is an alignment of the nucleotide sequences of glycoprotein D (gD) 
fromHSV-1 (SEQID NO: 1) and HSV-2 (gD-l (SEQ ID NO: 2) and gD-2 (SEQ ID NO: 
3)). 

Figure 8A shows a diagram of a method for expressing HIV gpl20 using 
30 genetic vaccine vectors and generation of a library of shuffled gpl20 genes. Figure 8B 
shows PGR primers that are useful for obtaining gpl20 nucleic acid substrates for DNA 
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shuffling reactions. Pnmers suitable for generating substrates include 6025F (SEQ ID NO: 
4), 7773R (SEQ ID NO: 5), and primers suitable for amplifying the shuffled nucleic acids 
include 61 96F (SEQ ID NO:>6) and 7746R (SEQ ID NO; 7), The primer BssH2-6205F (SEQ 
ID NO: 8) can be used to clone the resulting fragment into a genetic vaccine vector. 

, — _5 . ^. ^Figure-9-sho\v's the domain-structure-of_hepadnavirus..envelope^genes ^ 

Figure 1 0 shows a schematic representation of the use of shuffling to obtain 
hepadnavirus proteins in which the immunogenicity of one antigenic domain is improved. 

Figure 1 1 shows a strategy in which genes that encode the hepadnavirus 
proteins having one antigenic domain that has improved immunogenicity are shuffled to 
10 obtain recombinant proteins in which all three domains have improved immunogenicity. 

Figure 12 shows the transmembrane organization of the HBsAg pol\7)eptide. 
Figure 13 shows a method for using phage display to obtain recombinant 
allergens that are not bound by pre-existing IgE. 

Figure 14 shows a strategy for screening of recombinant allergens to identify 
15 those that are effective in activating Th cells. PBMC or T cell clones from atopic individuals 
are exposed to antigen-presenting cells that display the antigen variants obtained using the 
methods of the invention. To identify those allergen variants that are effective in activating T 
cells, the cultures are tested for induction of T cell proliferation or for a pattern of cytokine 
synthesis that is indicative of the particular type of T cell activation that is desired. If 
20 desired, the allergen variants that test positive in the in vitro, assay can be subjected to in vivo 
testing. 

Figure 15 shows a strategy for screening of recombinant cancer antigens to 
identify those that are effective in activating T cells of cancer patients. 

Figure 16A and Figure 16B show two different strategies for generating 
25 vectors that contain multiple T cell epitopes obtained, for example, by DNA shuffling. In 
Figure 16A, each individual shuffled epitope-encoding nucleic acid is linked to a single 
promoter, and multiple promoter-epitope gene constructs can be placed in a single vector. 
The scheme shown in Figure ISB involves linking muhiple epitope-encoding nucleic acids 
to a single promoter. 

30 Figure 1 7 shows the sequences of PreS2-S coding regions and corresponding 

amino acid sequences of different hepatitis B surface antigen (HBsAg) of woodchuck 
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hepatitis B (.\VliV) proteinsr Primers suitable for amplinc ' 

' shown. ; ' . ■ ■ " 

• Figure 18 shows primers that are suitable for arnplification of large frag^ 
that contain the S2S coding sequences. The primers hvbridize to regions that are ' 
5 approximately 200 bp outside the desired sequences. • 

Figure 19 shows an alignment of the amino acid sequences of surface. 
' antigens from different HVB and WHV subtypes. 

. . • Figure '20 shows a diagram, of multinheric panicles that assemble when an 
appropriate number of chimeric polypeptides and native HBsAg S monomers are mixed. ■ 

10 DETAILED DESCRIPTION 

Pefinidons 

The term "screening" descnbes. m general, a process that identifies optimal 
- ^ antigens. Several properties of the 'antigen can be used-ih selection arid screening including 
antigen expression, folding, stability, immunogenicity and, presence of epitopes from. several 
■ 15' related antigens. Selection is a form of screenmgin which identification and physical 

* separation are achieved simultaneously by expression of a selection marker, av hi ch, in some 
' genetic circumstances, allows cells expressing the marker to survive while other cells die (or 
vice versa). Screening markers include, for example,Tuciferase,.beta-gaiactosidase and 
green. fluorescent protein. ".Selection markers include drug and"toxin resistance genes, and 
' 20 the like,' Because of limitations in studying primary immune responses in vitro, in vivo 
studies are particularly useful screening methods. In these studies,, the antigens are first 
introduced to test animals, and the inimune responses. are subsequently studied by analyzing 
protective immune responses or by studying the quality or strength of the induced immune 
' . response using lyniphoid.cells derived' from the immunized animal Although spontaneous, 
25 selection can and does occur in the course of natural evolution, in the present methods 
selection is perfprrned by man. 

A ^'exogenous DNA segment", ''heterologous sequence" or a '^heterologous 
nucleic acid"; as used herein, is one that originates from a source foreign to the particular 
host cell, or, if from the same source, is modified from its original form. Thus, a ' 
30 heterologous gene in a host cell includes a gene that is endogenous to the particular host cell, 
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burhas been modified. Modification of a heterologous sequence in the applications 
described herein t\pically occurs through the use of DNA shuffling. Thus, the terms refer to 
a DNA segment which is foreign or heterologous to-the celL or homologous to the ceil but ii* 
a position within the host cell nucleic acid in which the element is riot ordinarily found. , 

—Exogenous-DNA.segments-are^e.xpressedjo3lieId exogenous polypep 

The term "gene" is used broadly to refer to any segment of DNA associated 
with a biological function. Thus, genes include coding sequences and/or the regulatory 
sequences required for their expression. Genes also include nonexpressed DNA segments 
that, for example, form recognition sequences for other proteins. Genes can be obtained from 
a vanety of sources, including cloning from a source of interest or synthesizing from teown 
or predicted sequence information, and may include sequences designed to have desired 
parameters. 

The term ''isolated", when applied to a nucleic acid or protein, denotes that 
the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography, A protein which is the predominant species present in a 
preparation is substantially purified. In particular, an isolated gene is separated from open 
reading frames which flank the gene and encode a protein other than the gene of interest. 
The term "purified'* denotes that a nucleic acid or protein gives rise to essentially one band 
in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 
about 50% pure, more preferably at least about 85% pure, and most preferably at least about 
99% pure. 

The term "naturally-occurring" is used to describe an object thai can be found 
in nature as distinct from being artificially produced. by man. For example, a polypeptide or 
polynucleotide sequence that is present in an. organism (including. viruses, bacteria, protozoa, 
insects, plants or mammalian tissue) that can be isolated from a source in nature and which 
has not been intentionally modified by man in the laboratory is naturally-occurring. 

The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 
polymers thereof in either single- or double-stranded form. Unless specifically limited, the 
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tenn encompasses nucleic acids containing ioiown analo • ' 

have similar binding properties as the reference nucleic acid and, are metabolized in a manner : 
similar to naturally occurring nucleotides. Unless.otherwise indicated, a particular nucleic • . , 
acid sequence also implicitly encompasses conservatively modified variants thereof (e^. 
5 degenerate codon substitutions) and complementary sequences and as well as the sequence 
explicitly indicated. Specifically, degenerate codon substitutions may be achieved .by 
generating sequences, in which the third position of one or more selected (or all) codons is' . , 
substituted, with mixed-base and/or deoxyinosine residues (Batzer et all (1991) Nucleic Acid , 
. -Res. 19: 5081; Ohtsuka et a/! (1985)/ BioL Chem. 260;- 2605-2608;^ Casspl et aL ( 1 992), :.. ; 
10 Rossolini a/. (1994) A/a/. Ce//. Proiej 5: 91-98). The term nucl^ 
, interchangeably with gene, cDNA, and mRNA encoded by a gene: 

^ "Nucleic acid derived fi-om a'gene'' refers to a nucleic acid for whose ^ - ' ' 

synthesis the gene, or a subsequence thereof, has ultimately served as a template. Thus, an 
, mRNA, a cDNA reverse transcribed fi-om an mRNA, an RNA transcribed from that cDNA, a - 
J 5 DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, e^^^ 

derived fi-om the gene and detection of such derived products is indicative of the presence 
and/or abundance of the original gene and/or gene transcript in a sample, .. 

. ^ -A-nucleic acidas ''operably linked'* when it is placed into, a fiinctional . ' 

relationship with another nucleic acid sequence. For instance, a promoter or enhancer is , 
20 operably linked to a coding sequence if it increases' the transcription, of the coding sequence. 
/ ' Operably Hnked means that the DNA sequences being linked are typically 

■ 'where necessary to join two protein coding regions, contiguous and in reading firame, . . . ■ 

However, since enhancers generally, function when separated fi-om the promoter by several 
kilobases and intronic sequences rnay be of variable -lengths, some polynucleotide elements , - 
25 ' may be operably linked but not contiguous. 
: ' A specific binding affinity between two molecules, for example, a ligand and 

a receptor, means a preferential binding of one molecule for another, in a mixture of 
molecules. The binding of the molecules can be considered specific if the binding affinity is . 
about 1 X 10"^ M to about I x 10^ M *^ or greater. 
. 30 . The term ''recombinant" when used with reference to a cell indicates that the 

cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 
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heteroloeous nucleic acid. Recombinant cells can contain genes that are not found within 
the native (non-recombinant) form of the cell/Recombinant cells, can also contain genes 
found in the native form of the cell wherein the genes are modified and re-introduced into 
the cell by artificial means. The term also encompasses cells that contain a nucleic acid 

5^_endo.geno.usjojhe^celi_that has been modified Avithout removin g the nucleic^cid_fiY}mjj^ 

cell; such modifications include those obtained by gene replacement, site-specific mutation, 
and related techniques. 

A ^'recombinant expression cassette'' or simply an ''expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
10 that are capable of effecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
tenmination signals. Tvpically, the recombinant expression cassette includes a nucleic acid 
to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter. 
Additional factors necessary or helpful in effecting expression may also be used as described 
15 herein. For example, an expression cassette can also include nucleotide sequences that 
encode a signal sequence that directs secretion of an expressed protein from the host cell. 
Transcription termination signals, enhancers, and other nucleic acid sequences that influence 
gene expression, can also be included in an expression casset&. 

A **multivalent antigenic polypeptide" or a ''recombinant nriultivalent 
20 antigenic polypeptide" is a non-naturally occurring polypeptide that includes amino acid 
sequences from more than one source polypeptide, which source poK^peptide is typically a 
naturally occurring poKpeplide. At least some of the regions of different amino acid 
sequences constitute epitopes that are recognized by antibodies found in a mammal that has 
been injected with the source polypeptide. The source polypeptides from which the different 
25 epitopes are derived are usually homologous (/.e., have the same or a similar structiu-e and/or ' 
function), and are of^en from different isolates, serotypes, strains, species, of organism or 
from different disease states, for example. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acid or polypeptide sequences, refer to two or more sequences or subsequences that 
30 are the same or have a specified percentage of amino acid residues or nucleotides that are the 
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same, when compared and aligned for maximum correspondence, as measured using one of 
' the following sequence cornparison algorithms or by visual inspection. 

The phrase ''substantially identical," in the context of nvo nucleic acids or 
polypeptides, refers to two or more' sequences or subsequences that have. at least 60%, 
5 preferably 80%, most preferably 90-95%) nucleotide or, amino acid residue identity, when 
compared and, aligned for maximum correspondence, as measured using. one of the following 
sequence comparison algorithms or by visual inspection. Preferably, the substantial identity 
exists over a region of the sequences that is at least about 50 residues in length, more 
preferably over a region of at least about 100 residues, and most preferably the sequences are 
10 substantially identical over at least about 150. residues, hi some embodiments, the sequences 
are substantially identical over the entire length of the coding regions. 
. . For sequence comparison, typically one sequence acts as a reference sequence 

to which test sequences are compared. WTien using a sequence comparison algorithm, test 
and reference sequences are input into a computer, subsequence coordinates are designated, 
15 ■ if necessary, and sequence algorithm'pro gram parameters, are designated. The sequence 
comparison algorithm then calculates the percent sequence identity for the test sequence(s) 
, relative to the reference sequence, based on the designated program parameters. 

Optimal alignment of sequences for comparison can be conducted, e.g.\ by 
the local homology algorithm of Smith & Waterman, Adv. Appi Math. 2:482 (1981), by the 
20 homology alignment algorithm of Needleman & Wunsch, J, Moi BipL 48:443 (1970), by the. 
search for similarity method of Pearson & Lipman, f roc. Nat 7. Acad, Sci. USA 85:2444 
(1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, 
and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 5.75 
Science Dr., Madison, \\T), or by visual inspection {see generally Ausubel et aL, infra). 
25 . One example of an algorithm that is suitable for determining percent 

. sequence identity and sequence similarity is the BLAST algorithm, which is described in 
Altschul etaL, J. Mol. BioL 215:403-410 (1990). Software for performing BLAST analyses 
is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nhn.nih.gov/). This algorithm mvolves first identifying high scoring 
30 , sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aligned with a word of 
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the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for - 
initiating searches to find longer HSPs containing them. The word hits are then extended in 
both directions along each sequence for as far as the cumulative alignment score can be 
"^irfcrersecir^Cmmilative^cFre^ 
M (reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 
calculate the cumulative score. Extension of the word hits in each direction are halted when; 
the cumulative alignment score falls off by the quantity X from its maximum achieved 
value; the cumulative score goes to zero or below, due to the accumulation of one or more 
negative-sconng residue alignments; or the end of either sequence is reached. The BLAST ■ 
aigonthm parameters W, T, and X detemiine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an 
expectation (E) of lO; a cutoff of 100, M=5, N=-4, and a comparison of both strands. For 
ammo acid sequences, the BLASTP program uses as defaults a wordlength (\V)^of 3, an 
expectation (E) of 10, and the BLOSUM62 scoring matrix {see Henikoff cS: Henikoff (1989) 
Proc. Natl. Acad ScL USA 89:10915). 

In addition to calculating percent sequence identity, the BLAST algorithm 
also perfomis a statistical analysis of the similarity between Uvo sequences (^ee, e.g., Karhn 
Sc Altschul (1993) Proc: Nat 'i Acad, ScL USA 90:5873-5787), One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between tvvo nucleotide or amino acid 
sequences would oceur by chance. For example, a nucleic acid is considered similar to a 
reference sequence if the smallest sum probability in a comparison of the test nucleic acid to 
the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and 
most preferably less than about 0,001. , . . 

Another indication that two nucleic acid sequences are substantially identical 
is that the two molecules hybridize to each other under stringent conditions. The phrase 
"hybridizing specifically to", refers to the binding,- duplexing, or hybridizing of a molecule 
only to a particular nucleotide sequence under stringent conditions when that sequence is 
present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially" 
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refers 10 complementary hybridization beUveen a probe nucleic acid, and a target nucleic acid 

and embraces minor mismatches that can be accommodated by reducing thC'Stringency of . ■ 

the hybridization media to achieve the desired detection of the target polynucleotide 
sequence. , ■ ■ • 

5 , '^Stringent hybridization conditions'' and ''stringent hybridization wash. 

conditions" in the context of nucleic acid hybridization experiments such as Southern and 
•northern hybridizations are sequence dependent, and are different under different 
environmental parameters. Longer sequences hybridize specifically at highertemperatures. 
;-.v- An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) 

10 Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic ' 
Acid Probes p^xx I chapter 2 "Overview of pnncipies of hybridization and:the strategy of 
nucleic acid probe assays", Elsevier, New York. Generally, highly stnngent hybndization 
and wash conditions. are selected to be about 5° G lower than the thermal melting point (TJ 
for the specific sequence at a defined ionic strength and pH. Typically, under "stringent 

15 conditions" a probe will hybridize to its target subsequence,, but to no other sequences. 

' . The Tm is the temperature (under defined ionic strength and pH) at which 

50% of the target sequence hybridizes, to a perfectly matched probe. Very stringent 
' conditions are selected to be equal to the Tm for a.particular probe. An example of stringent 
hybridization conditions for hybridization of complementary nucleic acids which have more 

20 than 1 00 complementary residues on a filter in a Southern or northern blot is 50% 

formaniide with 1 . mg of heparin at 42^*0, with the hybridization being carried out overnight. 
An example of highly stringent wash conditions is 0. 1 5M NaCl at 72°C for about 15 
minutes. An example of stringent wash conditions is a 0,2x SSC wash at 65°C for 15 
minutes (5^e, Sambrook, /n/ra., for a description of SSC buffer). Often, a high stringency 

25 wash is preceded by a low stringency wash to remove background probe signal. An example ; 
medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is Ix SSC at 45°G 
for 15 minutes. An example.low stringency wash for a duplex of, e,g., more than 100 
nucleotides,. is 4-6x SSC at 40^0 for 15 minutes,.. For short probes (e.g., about-10 to 50 
nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 

30 M Na" ion, typically about 0.01 to 1.0 M Na"" ion- concentration (or other salts) atpH 7.0 to 
8.3, and the temperature is typically at least about 30*^0. Stringent conditions can also be 
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achieved with the addition of destabilizing agents such as formamide. In general, a sienal to 
noise ratio of 2x (or higher) than that observed for an unrelated probe in the particular 
hybridization assay indicates detection of a specific hybridization. Nucleic acids which do 
not hybridize to each other under stringent conditions are still substantially identical if the 
pdlypepfideYwhichlhey encodeafesiibstantial lyidenticarTlris occurs, e.g. , wherTac^opy o"f 
a nucleic acid is created using the maximum codon degeneracy permitted by the genetic 
code. 

A further indication that rwo nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross reactive with, or specifically binds to, the polypeptide encoded by the 
second nucleic acid. Thus, a pol>T3eptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 

The phrase 'Xoecifically (or selectively) binds to an antibody*' or ''specifically 
(or selectively) immunoreactive with", when refemng to a protein or peptide, refers to a 
binding reaction which is determinative of the presence of the protein, or an epitope from the 
protein, in the presence of a heteroge-^.eous population of proteins and other biologies. Thusf 
under designated immunoassay conditions, the specified antibodies bind to a particular 
protein and do not bind in a significant amount to other proteins present in the sample. The 
antibodies raised against a multivalent antigenic polypeptide will generally bind to the ' 
proteins firom which one or more of the epitopes were obtained. Specific binding to an 
antibody under such conditions may require an .antibody that is selected for its specificity for 
,a particular protein. A variety of immunoassay formats. rnay be used to select antibodies . . 
specifically immunoreactive with a particular protein. For example, solid-phase ELISA 
immunoassays, Western blots, or immunohistochemistry are routinely used to select 
monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane 
(1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York 
/*Harlow and Lane'')> for a description of immunoassay formats and conditions that can be 
used to determine specific immunoreactivity. Typically a specific or selective reaction will 
be at least twice background signal or noise and more typically more than 1 0 to 1 00 times 
background. ■ 
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''Conservatively modified variations*' of a particular polynucleotide sequence 
, refers to those polynucleotides thai encode identical or essentially identical amino acid 
sequences, or where the polynucleotide does not encode an amino acid sequence, to 
* essentially identical sequences. Because.qf the degeneracy of the genetic code, a large - 
5 number of functionally identical nucleic acids encode any given polypeptide. For instance, 
the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. 
Thus, at every position where an arginine is specified by a codon,. the codop can be altered to 
any of the corresponding codons described without altering the encoded polypeptide. Such 
' nucleic acid variations are ''silent variations,** which are one species of "conservatively 
10 modified variations." Every polynucleotide sequence described herein which encodes a 
polypeptide also describes everv' possible silent variation, except where otherwise noted. 
One of skill will recognize that each codon in a.nucleic acid (except AUG, which is 
ordinarily the only codon for methionine) can be modified to yield a functionally identical 
molecule by standard techniques. Accordingly, each "silent variation- ' of a nucleic acid 
15 which encodes a polypeptide is implicit in each described sequence. 

Furthermore, one of skill will recognize that individual substitutions, . 
deletions or additions which alter, add or delete a* single amino acid or a small percentage of 
amino acids (typically less than 5%, more typically less than. 1%) in an encoded sequence, are 
"conservatively modified variations'.' where the alterations result in the substitution of an 
20 amino acid with a chemically similar amino acid. Conservative substitution tables providing 
functionally similar amino acids are vyell known in the art. The following five groups each 
contain amino acids that are conservative substitutions for one another: ^ 

Aliphatic; Glycine (G), Alanine (A), Valine (V), Leucine (L), Isoleucine (I); 
Aromatic: Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 
25 ' ^ Sulfur-containing: Methionine (Nl), Cysteine (C); 

Basic: Arginine (R), Lysine (K), Histidine (H); 

Acidic: Aspanic acid (D), Glutamic acid (E), Asparagine (N), Glutamine (Q). 
See also, Cz^ighion (1984) Proteins, W.H. Freeman and Company, for additional groupings 
of amino acids. In addition, individual substitutions, deletions or additions which alter, add 
30 or delete a single amino acid or a small percentage of amino acids in an encoded sequence 
are also "conservatively modified variations". 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 



15 



PCT/US99/02944 



A ^^subsequence" refers to a sequence of nucleic acids or amino acids that 
comprise. a part of a longer sequence of nucleic acids or amino acids {e,g., polypeptide) 
respectively. 

Description of the Preferred_Embodiments _ " . 

The invention provides a new approach to vaccine development, which is 
tenmed ''antigen librarv' immunization/* No other technologies are available for generating 
libraries of related antigens or optimizing known protective antigens. The most powerful 
previously existing methods for identification of vaccine antigens, such as high throughput 
sequencing or expression library immunization, only explore the sequence space provided by 
the pathogen genome. These approaches are likely to be insufficient, because they generally 
only target single pathogen strains, and because natural evolution has directed pathogens \o 
downregulate their own immunogenicity. In contrast, the immunization protocols of the 
invention, which use shuffled antigen libraries, provide a means to identify novel antigen 
sequences. Those antigens that are most protective can be selected ft'om these pools by in 
vivo challenge models, .\ntigen library immunization dramatically expands the diversity of 
available imjnunogen sequences, and therefore, these antigen chimera libraries can also 
provide means to defend against newly emerging pathogen variants of the future. The 
methods of the invention enable the identification of individual chimeric antigens that 
provide efficient protection against a variety of existing pathogens, providing improved 

Jr. . 

vaccines for troops and civilian populations. , 

The methods of the invention provide an evolution-based approach, such as 
DNA shuffling in particular, that is an optimal approach to improve the immuribgenicity~df 
many types of antigens. For example, the methods provide means of obtaining optimized 
cancer antigens useful for preventing and treating malignant diseases. Furthermore, an , 
increasing number of self-antigens, causing autoimmune diseases, and allergens, causing 
atopy, allergy and asthma, have been characterized. The immunogenicity and manufacturing . 
of these antigens can likewise be improved with the methods of this, invention. 

The antigen library immunization methods of the invention provide a means 
by which one can obtain a recombinant antigen that has improved ability to induce an 
immune response to a pathogenic agent. A '^pathogenic agent" refers to an organism or virus 
that is capable~ofinfecting-a-hostxeUrPathogenic-agents-typically4nclude-and/or-e 
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molecule, usually a polypeptide, that is immunogenic in that an immune response is raised 
against the immunogenic polypeptide. Often, the immune response raised against an 
immunogenic polypeptide from one serot>TDe of the pathogenic agent is not capable of 
recognizing, and thus protecting against; a different serotype of the pathogenic agent, or 
5 other related pathogenic agents. In other situations, the polypeptide .produced by a . 

pathogenic agent is not produced in sufficient amounts, or is not sufficiently immunogenic, 
for the infected host to raise an effective immune response against the pathogenic agent. 

These problems are overcome by the methods of the invention, which 
typically involve recombinirig two or more forms of a nucleic acid that encode a polypeptide 
10 of the pathogenic agent,, or antigen involved in another disease or condition. These 

recombination methods, referred to herein as "DNA shuffling", use as substrates forms of 
the nucleic acid that differ from each other in two or more nucleotides, so a library of 
recombinant nucleic acids results. The library is then screened to identify at least one 
optimized recombinant nucleic acid that encodes an optimized recombinant antigen that has 
15 improved ability to induce an immune response to the pathogenic agent or other condition. 
The resulting recombinant antigens often are chimeric in that they are recognized by 
antibodies (Abs) reacting against multiple pathogen strains, and generally can also elicit , 
broad spectrum immune responses. Specific neutralizing antibodies are known to mediate 
protection against several pathogens of interest, although additional mechanisms, such as- 
20 cytotoxic T lymphocytes, are likely to be involved. The concept of chimeric, multivalent 
* antigens inducing broadly reacting antibody responses is ftirther illustrated in Figure 1. 

In preferred embodiments, the different forms of the nucleic acids that encode 
antigenic polypeptides are obtained from members of a family of related pathogenic agents. 
This scheme of performing DNA shuffling using nucleic acids from related organisms, 
25 knownas "family shuffling," is described in Grameri er a/. 

and is shown schematically in Figure 2. Polypeptides of different strains and'serotypes of 
pathogens generally vary between 60-98%, which will allow for efflcient family DNA 
shuffling. Therefore, family DNA shuffling provides an effective approach to generate 
multivalent, crossprotective antigens. The methods are usefiil for obtaining individual 
30 chimeras that effectively protect against most or all pathogen variants (Figure 3 A). 

Moreover, immunizations using entire Ubraries or pools of shuffled antigen chimeras can 
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also result in identification of chimeric antigens that protect against pathogen variants that 
were not included in the starting population of antigens (for example, protection against 
strain C by shuffled library of chimeras/mutants of strains A and B in Figure 3B). 
Accordingly, the antigen library immunization approach enables the development of 

~5~ ~immunogenic~po 

newly emerging pathogen variants. 

Sequence recombination can be achieved in many different formats and 
permutations of formats, as described in further detail below. These formats share some 
common principles. For example, the targets for modification vary in different applications, 

10 as does the property sought to be acquired or improved. Examples of candidate targets for 
acquisition of a propeny or improvement in a property include genes that encode proteins 
which have immunogenic and/'or toxigenic activity when introduced mto a host organisin. 

The methods use at least rwo variant forms of a starling target. The variar.i 
forms of candidate substrates can show substantial sequence or secondary structural 'p 

1 5 similarity with each other, but they should also differ in at least one and preferably at leasi 

two positions. The initial diversity between forms can be the result of natural variation, e,g,, t?.^ 
the different variant forms (homologs) are obtained from different individuals or strains of 
an organism, or constitute related sequences from the same organism {e.g., allelic i 
variations), or constitute homologs from different organisms (interspecific variants). 

20 Alternatively, initial diversity can be induced, e.g., the variant forms can be generated by 
error-prone transcription, such as an error-prone. PGR or use of a polymerase which lacks 
^proof-reading activity (^ee. Liao (1990) Gewe 88:107-1 11), of the first variant form, or, by 
replication of the first form in a mutator strain (mutator host cells are discussed in further 
detail below, and are generally well known). A mutator strain can include any mutants m any 

25 organism impaired in the functions of mismatch repair. These include mutant gene products 
of mutS, muiT, mutH, mutL, ovrD, dcm, vsr, umuC, umuD, sbcB, recJ, etc. The impairment 
is achieved by genetic mutation, allelic replacement, selective inhibition by ah added reagent 
such as a small compound or an expressed antisense RNA, or other techniques. Impairment- 
can be of the genes noted, or of homologous genes in any organism. Other methods of 

30 generating initial diversity include methods well known to those of skill in the art, including, 
for example, treatment of a nucleic acid with a chemical or other mutagen, through 
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spontaneous mutation, and by inducing an error-prone repair system (e.g., SOS) in, a cell that * 
contains the nucleic acid. The. initial, diversity betAveen substrates is greatly augmented in ■ 
' subsequent steps of recombination for library generation. 

Properties involved in immunogenicity. ' ' ^ 

5 The effectiveness of an antigen in inducing an immune response against a ■ 

. pathogen can depend upon several factors, many of which are'not well understood. Most 
' previously available methods for increasing the effectiveness of antigens are'dependent upon 
understanding the molecular basis for these factors. However, DNA shuffling and antigen 
" ' library inirnunization according to the methods of the invention are effective even where the 
10 molecular bases are unknovvTi. The methods of the invention do not rely upon a /jw^^^ 

assumptions. ' ' , ' ^ 

Polynucleotide sequences that can positively or negatively affect the 
/ ' immunogenicity of an antigen encoded by the polynucleotide are often scattered throughout ^ , ■ 

the entire antigen gene. Several of these factors are shown diagrainmatically in Figure 4. By 
15 ' recombining different forms of polynucleotide that encode 'the antigen using DNA shuffling, ■ 

' followed by 'selection for those chimeric polynucleotides that encode, an antigen that can . ^ 

induce, an improved immune response, one can obtain pnmarily sequences that have a • 
positive influence on antigen inimunbgenicity. Those sequences that negatively af^^ 
' antigen immunogenicity are eliminated (Figure 4). One' need not know the particular 
■ 20 ' .sequences involved. \ ' - . - , 

pNA Shuffling Methods 

\ ' . .Gensraliy, the methods of the invention entail performing DNA . ^ 

* recombination ("shuffling") and screening or selection to/'evolve" individual genes, whole 

. / plasmids or viruses, niuUigene clusters, or even whole ge^ 
25 / Bio/Techriology 13:549-553), Reiterative cycles of recombination and screening/selection • 
can be performed to flirther evolve the nucleic acids of interest. Such techniques do not 
require the extensive analysis and compulation required by conventional methods for 
polypeptide engineering. Shuffling allows the recombination of large numbers of mutations . 
in a minimum number of selection cycles, in contrast to natural pair-wise recombination 
30 events (e.g., as occur during sexual replication). Thus, the sequence recombination 
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techniqueS'described herein provide particular advantages in that they provide recombination 
between mutations in any or all of these, thereby providing a ver>' fast way of exploring the 
manner in which different combinations of mutations can affect a desired result. In some 
instances, however, structural and/'or functional information is available which, although not 

— 5— — required-for sequence recombinationrprovides-opportunities for^modification-of-the 

technique. 

The DNA shuffling methods of the invention can involve at least one of at 
least four different approaches to improve immunogenic activity as well as to broaden 
^ specificity. First, DNA shuffling can be performed on a single gene. Secondly, several 

10 highly Homologous genes can be identified by sequence comparison with known 

homologous genes. These genes can be synthesized and shuffled as a family of homologs, to 
select recombinants with the desired activity. The shuffled genes can be cloned into 
appropriate host cells, such as E. coli, yeast, plants, fungi, animal cells, and the like, and 
those that encode antigens having the desired properties can be identified by the methods 

15 described below. Third, whole genome shuffling can be performed to shuffle genes that 

encode antigenic polypeptides (along with other genomic nucleic acids). For whole genome 
shuffling approaches, it is not even necessary 10 identify which genes are being shuffled. 
Instead, e.g., bacterial cell or viral genomes are combined and shuffled to acquire 
recombinant polypeptides that have enhanced ability to induce an immune response, as 

20 measured in any of the assays descnbed below. Fourth, antigenic polypeptide-encoding 
genes can be codon modified to access mutational diversity not present in any naturally 
occurring gene. Details on each of these procedures can be found below. 

Exemplary formats and examples for sequence recombination, sometimes 
referred to as DNA shuffling, evolution, or molecular breeding, have been described by the 

25 present inventors and co-workers in co-pending applications U.S. Patent Application Serial 
No. 08/1 98,431, -filed February 17, 1994, Serial No. PCT/yS95/02126, filed, February 17, 
1995, Serial No. 08/425,684, filed April 18, 1995, Serial No. 08/537,874, filed October 30, 
1995, Serial No. 08/564,955, filed November 30, 1995, Serial No. 08/621,859, filed March 
25, 1996, Serial No. 08/621,430, filed March 25, 1996, Serial No. PCT/US96/05480, filed 

30 April 18, 1996, Serial No. 08/650,400, filed May 20, 1996, Serial No. 08/675,502, filed July 
3, 1996, Serial No. 08/721, 824, filed September 27, 1996, Serial No. PCT/US97/17300, 
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■filed September 26, 1997, and Serial No. PCTAJS97/24239; filed December 1.7, 1997; ' / 
Stemmer/5c/^«ce27O-151O{19?50; Stemniererfl/., Gene 164:49-53X1995); Steirimer,; ^ 

Bio/Technology 13:549-553 (1995); Stemmer, Proc, Natl/Acad, Sci. USA, RI-: 10747- 10751 \. 
' ' (1994); SteriimenyVa/wre 370:389-391 (1994); Cramen era^^ 
5 (1996); Crameri et ai. Nature Biotechnology 14:315-319(1996), each of which is , 
incorporated by reference in its entirety, for all purposes. - 

, • ' ■ . Other methods for obtaining recombinant polynucleotides and/or for 

obtaining diversity in nucleic acids used as the substrates for shuffling include, for example, 
, . ^ homologous recombination (PCT/US98/05223 ; Publ. No. W098/42727); 

10 directed mutagenesis (for review Smith, Ann, Rev. Genet, \9\ 423-462 (1985); Botstein 
'andShortle, Science 129: 1 193-1201 (1985); Carter. Biochem-jAZl: 1-7 (1986); Kunkel, ■ 
"The efficiency of oligonucleotide directed niutagenesis" in Nucleic acids & Molecular : - 
Biology, Eckstein and Lilley, eds., Springer Verlag, Berlin (1987)), Included among these • 
methods are oHgonucleotide-directed mutagenesis (Zolier and Smith; ;Vuc^^^ 

15' M%1'650Q {\9^2),^ Methods in EnzymolAQQ\ A6^'^^ 

154: 329-350 (1987)) phosphothioate-modifiedDNA mutagenesis (Taylor er a/.,. VwcA^ 
Res, 13: 8749-8764(1985); Taylor er a/., M/c/.'^c^i/^ Res, 13: 8765-8787 (1985); Nakamaye . • 

and Eckstein, Nucl Acids Res. 14: 9679.9698.(1986); Sayers et aL\ NucL Acids Res, 16: \ • 
. 791-802 (1988); Sayers era/:, A^uc/.^ezii5^^5. 16:803-814(1988)^^ ' ^ 

20 uracil-containing templates (Kunkel, Proc, A^a/7./ca^/. ; ■ , 

Kunkel et ai. Methods in EnzymoL 154: 367-382)); mutagenesis using gapped duplex DNA 
(Kramer era/,, Nucl. Acids Res. 12: 9441-9456 (1984); Kramer and Fritz,.Merfto£/5i>i 
Enzymoi 154: 350-367 (1987); Kramer er a/., A^wc/. .^c/Js i?e5. 16: 7207 (1988)); and Fritz er - 
. dl. Nucl Acids Res. 16: 6987-6999 (1988)). Additional suitable methods include point . 

■25 ' mismatch repair (Kramer er a/., Ce// 38: 879-887 (1984)), mutagenesis using regairr^^ 
host strains (Carter era/., A^wc/. iJe5, 13: 4431-4443 (1985); Carter,^ 
fnzymo/; 154: 38*2-403 (1987)), deletion mutagenesis (Eghtedarzadeh and Hem " 
Acids Res, 14: 51 15 (1986)), restriction-selecUon and restriction-purification (Wells er a/., 
Phil. Trans, R. Soc. Lond. A 317: 415-423 (1986)), mutagenesis by total gene synthesis 

30 (Nambiar er a/., Sc/e/ice 223: 1299-1301 (1984); Sakamar and Khorana//^^^ 

6361-6372 (1988); Wells era/., Gene 34: 315-323 (1985); and Grundstrom er a/., /Vwc/. 
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Acids Res. 13: 3305-3316 (1985). Kits for muiagenesis are commercially available (e.^., 
Bio-Rad, Amersham International .\ngiian Biotechnology). 

The breeding procedure stans with at least nvo substrates that generally show 
some degree of sequence identity to each other (/.e., at least about 30%, 50%, 70%, 80% or 

any t>qpe of mutation, for example, substitutions, insertions and deletions. Often, different 
segments differ from each other in about 5-20 positions. For recombination to generate 
increased diversity relative to the starting materials, the staning materials must differ from 
each other in at least two nucleotide positions. That is, if there are only two substrates, there 

10 should be at least two divergent positions. If there are three substrates, for example, one 
substrate can differ from the second at a single position, and the second can differ from the 
third at a different single position. The starting DNA segments can be natural variants of 
each other, for example, allelic or species variants. The segments can also be from 
^ nonallelic genes showing some degree of structural and usually functional relatedness (e.g., 
. 15 different genes within a superfamily, such as the family of Yersinia V-antigens, for 

example). The starting DNA segments can also be induced variants of each other. For 
example, one DNA segment can be produced by error-prone PCR replication of the other, 
the nucleic acid can be treated with a chemical or other mutagen, or by substitution of a 
mutagenic cassette. Induced mutants can also be prepared by propagating one (or both) of 

20 the segments in a mutagenic strain, or by inducing an error-prone repair system in the^cells. ' 
In these situations, strictly speaking, the second DNA segment is not a single segment but a 
. . l^g^.f?n}ily of related segments. The different segnents forming the starting materials are 
often the same length or substantially the same length. However, this need not be the case; 
for example; one segment can be a subsequence of another. The segments can be present as 

25 part of larger molecules, such as vectors, or can be in isolated form. 

The starting DNA segments are recombined by any of the sequence 
recombination formats provided herein to generate a diverse library of recombinant DNA 
segments. Such a librar>' can vary widely in size from having fewer than 10 to more than 
\0\ 10^ 10^^ or more members. In some embodiments, the starting segments and the 

30 recombinant libraries generated will include full-length coding sequences and any essential 
regulatory sequences, such as a promoter and polyadenylation sequence, required for 
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expression. In other embodiments, the recombinant DNA segments in the library can be 
inserted into a common vector providing sequences necessar\' for expression before 
performing screening/selection. 

Substrates for Evolution of Optimized Recombinant Antigens 
5 The invention provides methods of obtaining recombinant polynucleotides 

. that encode antigens that exhibit improved ability to induce an immune response to a 

■ 

pathogenic agent. The methods are applicable to a wide range of pathogenic agents, 
including potential biological warfare agents and other organisms and polypeptides that can 
cause disease and toxicity in humans and other anirnals. The following examples are mereiv 
10 illustrative, and not limiting. . . 

. ' 1. Bacterial Pathogens and Toxins 

In some embodiments, the methods of the invention are applied to bacterial ' 
pathogens, as well as to toxins produced by bactena and other organisms. One can use the 
methods to obtain recombinant polypeptides that can induce an immune response against the 

1 5 pathogen, as well as recombinant toxins that are iess toxic than native toxin polypeptides. 
Often, the polynucleotides of interest encode polypeptides that are present on the surface of 
the pathogenic organism. ^ . 

.\mong the pathogens for which the methods of thejnvention are useful for 
producing protective immunogenic recombinant polypeptides are the yer5m/a, species. 

20 Yersinia pestis\ the causative agent of plague, is one of the most virulent bacteria known 
with LD50 values in mouse of less than 10 bacteria. The pneumonic form of the disease is 
readily spread between humans by aerosol or infectious droplets and can be lethal within 
days. A particularly preferred target for obtaining a recombinant polypeptide that can protect 
^%d^nsi Yersinia infection is the V antigen, which is a 37 kDa virulence factor, induces 

25 protective immune responses and is currently being evaluated as a subunit vaccine (Brubaker 
(1991) Current Investigations of the Microbiology^ ofYersinae^ 12: 127). the V-antigen 
alone is not toxic, but Y, pestis isolates that lack the V-antigen are avirulent. The Yersinia V- 
antigen has been successfully produced in £. co// ,by several groups (Leary et al. (1995) 
Infect, Immun. 3: 2854). Antibodies that recognize the V-antigen can provide passive 

30 protection against homologous strains, but not against heterologous strains. Similarly, 

immunization with purified V antigen protects against only homologous strains. To obtain 
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quantities of cholera toxin can induce severe diarrhea causing loss of tens of liters of fluid. 
Cholera toxin is a complex of a single catalytic A subunit with a pentameric ring of identical 
B subunits. Each subunit is inactive on its own. The B subunits bind to specific gangiioside 
receptors on the surface of intestinal epithelial ceils and trigger the entry of the A subunit ■ 
'^into'the"cellrThe~A""subuni t~AD 
events causing a massive, sustained flow of electrolytes and water into the intestinal lumen 
resulting in extreme diarrhea. . 

The B subunit of cholera toxin is an attractive vaccine target for a number of 
reasons. It isa major target of protective antibodies generated during cholera infection and 
contains the epitopes for antitoxin neutralizing antibodies. It is nontoxic without the A 
subunit, is orally effective, and stimulates production of a strong IgA-dominated gut mucosal 
immune response, which is essential in protection against cholera and cholera toxin. The B 
subunit is also being in\'estigaied for use as an adjuvant in other vaccine preparations, and 
therefore, evolved toxms may provide general improvements for a variety of different 
vaccines. The heat-labile enterotoxins (LT) from enterotoxigenic E. coli strains are 
structurally related to cholera toxin and are 75% identical at the DNA sequence level. To 
obtain optimized recombinant toxin molecules that exhibit reduced toxicity and increased 
abihty to induce an immune response that is protective against V. cholerae and E. coli, the 
genes that encode the related toxins are subjected to DNA shuffling. 

The recombinant toxins are then tested for one or more of a several desirable 
traits. For example, one can screen for improved cross-reactivity of antibodies raised against 
the recombinant toxin polypeptides, for lack of toxicity in a cell culture assay, and for ability 
to induce a protective immune response against the pathogens and/or against the toxins 
themselves. The shuffled clones can be selected by phage display and/or screened by phage 
ELISA and ELISA assays for the presence of epitopes from the'different serotypes. Variant 
proteins with multiple epitopes can then be purified and used to' immunize mice or other test 
animal. The animal serum is then assayed for antibodies to the different B chain subtypes 
and variants that elicit a broad cross-reactive response will be evaluated further in a virulent 
challenge model. The £. coli and V. cholerae toxins can also act as adjuvants that are 
. capable of enhancing mucosal immunity and oral delivery of vaccines and proteins. 
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Accordingly, one can test the librarv' of recombinant toxins for enhancement of the adjuvant, 
activity. , ■ . , 

Shuffled antigens can also.be screened for improved expression levels and 
stability of the B chain pentamer, which may be less stable than when in the presence of the 
5 A chain in thehexameric complex. Addition of a heat treatment step or denaturing agents 
such as salts, urea, and/or guanidine hydrochloride can be included prior to ELISA assays to 
measure yields of correctly folded molecules by appropriate antibodies. It is , sometimes 
desirable to screen for stable monomeric B chain molecules, in an ELISA format, for ' 
. ,^,,.exarriple, using antibodies that bind monomeric, but not pentameric B chains. Additionally, 
1,0 the ability of shuffled antigens to ehcil neutralizing antibodies in . an appropriate animal 

model such as mouse or monkey can be screened. For example, antibodies that bind to the B 
chain and prevent its binding to its specific gangiioside receptors on the surface of intestinal 
epithelial cells may prevent disease. Similarly antibodies that bind to the B chain and prevent 
its pentamerization or block A chain binding may be useful in preventing disease. 
15 The bacterial antigens that can be improved by DNA shuffling for use as ■ 

vaccines also include, but are not limited to, Helicobacter pylori antigens CagA and Vac A 
XBlaser (1 996) Aliment. Pharmacol Ther. 1 : 73-7; Blaser and Crabtree (1 996) Am, 1 Clin.' 
Pathoi: 106: 565-7; Cmsini et at. (1996) Proc. Nai'i Acad. Sci. USA 93: 14648-14643). 
Other suitable //. /?v/c?rz antigens include, for example, four immunoreactive proteins of 45- . 
20 65 kDa as reported by Chatha era/. (1997) Indian J, Med. Res. 105: 170-175 and the.//. 
. pylori GroES homologue.(HspA) (Kansau et ai (1996) Mpi MicrobioL 22: 1013-1023, 
Other suitable bacterial antigens include, but are not limited to, the 43-kDa and the fimbrilin 
(41 kDa) proteins of P. gingivalis (Boutsl et ai (1996) Oral MicrobioL Immunol, 11 : 236- 
241); pneumococcal surface protein A (Briles et a/. (1996) Ann, NY Acad, Sci, 797: 118- ' 
25. 126); Chlamydia psittaci ani\gons, &0'9Q kDa protein and 1 10 kDa protein (Buendia er ai 
■ " (1991) FEMS MicrobioL Lett. A 50:1 13-9)\ the ch\^myd'm^ 

(Whittum-Hudson er al. {1996) Nature Med, 2: 11 16-1 121); Chlamydia pneumoniae S])ecits- 
specific antigens in the molecular weight ranges 92-98, 51-55, 43-46 and 31.5-33 kDa and 
genus-specific antigens in the ranges 12, 26 and 65-70 kDa (Halme et al. (1997) Scand. J. 
30 Immunol. 45: 378-84); Neisseria gonorrhoeae (GC) or Escherichia coli phase-variable 

opacity (Opa) proteins (Chen and Gotschlich (1996) Proc. Nat 'I. Acad. Sci. USA 93: 14851- 
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crass-protective recombinant V antigen, in a preferred embodiment, V antigen genes from 
various Yersinia species are subjected to family shuffling. The genes encoding the V antigen 
from K pestis, Y. enterocolitica, and^ Y, pseudotuberculosis, for example, are 92-99% 
identical at the DNA level, making them ideal for optimization using family shuffling 
5 ^~Ic^oixiing~tTthFmetlT^ 

acids is screened and/or selected for those that encode recombinant V antigen polypeptides 
that can induce an improved immune response and/or have greater cross-protectivity. 

Bacillus anthracis, the causative agent of anthrax, is another example of a 
bacterial target against which the methods of the invention are useftil. The anthrax protective 
10 antigen (PA) provides protective immune responses in test animals, and antibodies against 
PA also provide some protection. However, the immunogenicity of PA is relatively poor, so 
multiple injections are typically required when wild-type PA is iised. Co-vaccination with 
lethal factor (LF) can improve the efficacy of wild-t>pe PA vaccines, but toxicity is a 
limiting factor. Accordingly the DNA shuffling and antigen library immunization methods "'I 
15 of the invention can be used to obtain nontoxic LF. Polynucleotides that encode LF from 
various'5- anthracis strains are subjected to family shuffling. The resulting library of ^ 
recombinant LF nucleic acids can then be screened to identify those that encode recombinant 
LF polypeptides that exhibit reduced toxicity. For example, one can inoculate tissue culture j'"- 
cells with the recombinant LF polypeptides in the presence of PA and select those clones for v 
20 ' which the cells survive. If desired, one can then backcross the nontoxic LF polypeptides to jI 
retain the immunogenic epitopes of LF. Those that are selected through the first screen can 
'then be subjected to a secondary screen. For example, one can test for the ability of the 
recombinant nontoxic LF polypeptides to induce an immune response (e.^./CTL or antibody 
response) in a test animal such as mice. In preferred embodiments, the recombinant nontoxic 
25 LF polypeptides are then tested for ability to induce protective immunity in test animals 
, against challenge by different strains of 5.' anthracis. 

The protective antigen (PA) of B. anthracis is also a suitable target for the 
:methods of the invention. PA-encoding nucleic acids from various strains of B. anthracis. btq 
subjected to DNA shuffling. One can "then screen for proper folding in, for example, £. co/z, 
30 using polyclonal antibodies. Screening for ability to induce broad-spectrum antibodies in a 
. test animal is also typically used, either alone or in addition to a preliminary screening 
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method. In presently preferred embodiments, those recombinant polynucleotides that exhibit 
, the desired properties can be backcrossed so- that the immunogenic epitopes are maintained. 
Finally, the selected recombinants are tested for ability to induce protective immunity 
agamst different strains of 5. ^2/zr/irflc/5 in a test animal. 
5 The Staphylococcus aureus Strepiococcus toxins are another example of 

a target polypeptide that can be altered usmg the methods of the invention. Strains of , 
Staphylococcus aureus and group A Streptococci are involved in a range of diseases, 
including food poisoning, toxic shock s>Tidrome, scarlet, fever and various autoimmune 
. . , disorders. They secrete a variety of toxins, which include at least five cytolytic toxins, a . 
10 coagulase, and a variety of enterotoxins. The enterotoxins are classified as superantigens in 
that they crosslink MHC class II molecules with T cell receptors to cause a constitutive T 
cell activation (Fields ^z/. (\996) Nature l>iA: 188). This results in the accumulation of 
pathogenic levels of cytokines that can lead to muliipie organ/ailure and death. At least 
thirty related, yet distinct enterotoxins have been sequenced and can be phylogenetically 
15 grouped into -families. Crystal structures have been obtained for several members alone and 
■ • in complex with MHC class II molecules. Certain mutations in the MHC class Il-binding site, 
of the toxins strongly reduce their toxicity and can form the basis of attenuated vaccines 
(Woody eV a/. (1997) Vaccine 15; 133). However, a successful immune response to one type 

■ of toxin may provide protection against closely related family members, whereas little 

20 protection against toxins from the other families is observed. Family shuffling of enterotoxin 
genes from various family membeirs can be used to obtain, recombinant toxin molecules that 
. have reduced toxicity and can induce a cross-protective immune response. Shuffled antigens 
can also be screened to identify antigens that ehcit neutralizing antibodies in an appropriate 
/animal model such as mouse or monkey. Examples of such assays can include ELISA 

25 formats in which the elicited antibodies prevent binding of the enterotoxin to the MHC - . 

complex and/or T cell receptors on cells or purified forms. These assays can also include 
formats where the added antibodies would prevent T cells from being cross-linked to 

■ appropnaie antigen presenting cells. - . 

Cholera is an ancient, potentially lethal disease caused by the bacterium 
30 Vibrio cholerae and an effective vaccine for its prevention is still unavailable. Much of the 
pathogenesis of this disease is caused by the cholera enterotoxin. Ingestion of microgram 
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' 14856), any of the twelve immunodominant proteins oi Schistosoma mansoni (ranging in 
molecular weight from 14 to 208 kDa) as descnbed by Cutts and Wilson (1997) 
Parasitology' 1 14; 245-55; the 1 7-kJ3a protein antigen of Brucella abortus (De Mot et ai- 

(1996) Curr Microbiol. 33; 26-30); a gene homologof the 1 7-kDa protein antigen of the 
^S—^Gr^m-nQgnUvc-paihogm^Bnicella'abortusnd^ 

Rhodococcus sp. NI86/21 (De Mot et aL.{\996) Curr. Microbiol 33; 26-30); the 
staphylococcal enterotoxins (SEs) (Wood et ai (1997) F EMS Immunol Med. Microbiol 17; 
1-10), a 42-kDa M. hyopneumomae NrdF ribonucleotide reductase R2 protein or 15-kDa 
subunit protein of M hyopneumoniae (Fagan et al (1997) Infect. Immun. 65; 2502-2507), 

'10 the meningococcal antigen PorA protein (Feavers et al (1997) Clin. Diagn, Lab. Immunol 
3: 444-50); pneumococcal surface protein A (PspA) (McDaniel et al (1997) Gene Ther. 4: 
375-377); F. tularensis outer membrane protein FopA (Fulop et al (1996) FEMS Immunol 
Med. Microbiol 13; 245-247); the major outer membrane protein within strains of the genu5 
Actinobacillus (Hanmann et al (1996) Zentralbl Bakteriol 284: 255-262); p60 or . i: 

15 listeriolysin (Hly) antigen o( Listeria monocytogenes (Hess et al. (1996) Proc. Nat 7. Acad. 

Scl USA 93: 1458-1463); flagellar (G) antigens obser\*ed on Salmonella enteritidis and S. ^ 
pullorum (Holt and Chaubal (1997) J. Clin. Microbiol 35: 1016-1020); Bacillus anthracis 
protective antigen (PA) (Ivins et al (1995) Vaccine 13: 1779-1784); Echinococcus 
granulosus antigen 5 (Jones et al (1996) Parasitology 113: 213-222); the rol genes of 

20 Shigella dysenteriae 1 and Escherichia coli K-12 (Klee et al (1997) J. Bacteriol 179; 2421- ^ 
2425); cell surface proteins Rib and alpha of group B streptococcus (Larsson et al. (1996) 
Infect. Immun. 64: 3518-3523); the 37 kDa secreted polypeptide encoded on the 70 kb 
virulence plasmid of pathogenic Yersinia spp. (Leary et al (1995) Contrib. Microbiol 
Immunol 13: 216-217 and Roggenkamp et al (1997) Infect. Immun. 65: 446-51); the OspA 

25 (outer surface protein A) of the Lyme disease spirochete Borrelia burgdorferi (Li et al 

(1997) Proc. Nat 7. Acad. ScL USA 94: 3584-3589, Padilla et al. (1996) / Infect. Dis. 174: 
739-746, and Wallich et al (1996) Infection 24: 396-397); the Brucella melitensis group 3 
amigen gene encoding Omp28 (Lindler eM/. (1996/ /^/ec^ //wmwAz. 64:^ 2 

antigen of Streptococcus mutans (Murakami et al (1997) Infect. Immun. 65: 794-797); 
30 pneumolysin, Pneumococcal neuraminidases, autolysin, hyaluronidase, and the 37 kDa 

pneumococcal surface adhesin A (Paton et al (1991) Microb. Drug Resist. 3: 1-10); 29-32, 
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41-45, 63-71 X 10(3) MW antigens o( Salmonella typhi (Perez ei aL (1996) Immunology 89: 
262-267); K-antigen as a marker of Klebsiella pneumoniae (Priamukhina and Morozova 
(1996) Klin. Lab\ Diagn. Al'9)\ nocardial antigens of molecular mass approximately 60, 40,. 
20 and 15-10 kDa (Prokesova et aL (1996) InL J, ImmunopharmacoL 18: 661-668); 
5 Staphylococcus aureus antigen ORF-2 (Rieneck et al. (1 997) Biochim Biophys Acta 1 350: 
128-132); GlpQ antigen of Borreliahermsii(Schwan et aL (1996)7. Clin. Microbiol. 34: 
2483-2492); cholera protective antigen (CPA) (Sciortino (1996) J. Diarrhoeal his. Res. 14: 
16-26); a 190-kDa protein antigen of Streptococcus mutaris.(Senpuku al. (1996^ Oral 
MicrobioL ImmunoL 11: 121-1 28); Anthrax toxin protective antigen (PA) (Sharma et aL 
10 . (1996) Protein Expr. Purif. 7: 33-38); Clostridium perfringens antigens and toxoid (Strom et 
aL (1995) Br. 1 Rheumatol. 34: 1095-1096); the SEF14 fimbrial antigen of Salmonella 
enteritidis (Thorns ei aL (1996) Microb. Pathog. 20; 235-246); the Yersinia pestis capsular 
antigen (Fl antigen) (Titbail et-aL (1997) Infect. Immim. 65: 1926-1930); a 35-kiIodalion 
protein of Mycobacterium leprae (Triccas et al. ( 1 996yi Infect. Immun. -64: 5 1 71 -5 1 77); the 
15 , major outer membrane protein, CD, extracted from Moraxella (Branhamella) catarrhalis 
. (Yang et aL (1997) FEMS Immunol. Med MicrobioL 17: 187-199); pH6 antigen (PsaA 
protein) oJf Yersinia pestis (Zav'yalov et aL (1996) FEMS ImmunoL Med. MicrobioL 14: 53- 
57); a major surface glycoprotein, gp63, of Leishmania major (Xu and, Liew (1994) Vaccine 
12: 1534-1536; Xu and U^w i\99 5) Immunology 84: 173-176>; mycobacterial heat shock 
20 protein 65, mycobacterial antigen {Mycobacterium leprae hsp65) (Lowrie dl. (1994) * 
Vaccine 12: 1537-1540; Ragno et al, (1991) Arthritis Rheum. AO: 277-283; Silva(1995) 
Braz. J. Med. Biol. Res. 28: 843-851); Mycobacterium tuberculosis antigen 85 (Ag85) 
(Huygen et aL (1996) Nat. Med. 2: 893-898); the 45/47 kDa antigen complex (APA) of 
Mycobacterium tuberculosis, M. bovis and BCG (Horn et al. (1996) J. ImmunoL Methods 
25 ; 197: 151-159); the mycobacterial antigen, 65-kDa,heat shock protein, hsp65 (Tascon et al. 
(1996) Nat. Med. 2: 888-892); the mycobacterial antigens K1PB64, MPB70, MPB57 and 
alpha antigen (Yamada et aL (1995) Kekkaku 70: 639-644); the M. tuberculosis 38 kDa 
protein (Vordermeier et al. (1995) Vaccine 13: 1576-1582); the MPT63, MPT64 and MPT- 
59 antigens from Mycobacterium tuberculosis (Manca et al. (1997) Infect. Immun. 65: 16-' 
30 23; Oettinger et aL (1997) Scand 1 ImmunoL 45: 499-503; Wilcke et aL (1996) Tuber. Lung 
Dis. 77: 250-256); the 35-kilodalton protein of Mycobacterium leprae (Triccas et aL (1996) 
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Infect. Inimun. 64: 5171-5177); the ESAT-6 antigen of virulent mycobacteria (Brandt et aL 
(1996) y. Immunol, 157: 3527-3533; Pollock and Andersen (1997)7. Infect. Dis. 175: 1251- 
1254); Mycobacterium tuberculosis 16-kDa antigen-.(HspI6.3) (Change/ a/. (1996) J. Biol. 
Chem. 271: 72 1 8-7223); and the 1 8-kilodalton protein of Mycobacterium leprae (Baumgart 

~57^et'-al7{'l996y-lnfectrlmmunr6^4^^^^^^ ■ y— 

2. Vira! Pathogens v 
The methods of the invention are also useful for obtaining recombinant 
nucleic acids and polypeptides that have enhanced ability to induce an immune response 
against viral pathogens. While the bacterial recombinants described above are typically 

10 administered in polypeptide form, recombinants that confer viral protection are preferably 
administered in nucleic acid form, as genetic vaccines. 

One illustrative example is the Hantaan virus. Glycoproteins of this vims 
typically accumulate at the membranes of the Golgi apparatus of infected cells. This poor 
expression of the glycoprotein prevents the development of efficient genetic vaccines against 

15 these viruses. The methods of the invention solve this problem by performing DNA 

shuffling on nucleic acids that encode ^he glycoproteins and identifying those recombinants 
that exhibit enhanced expression in a host cell, and/or for improved immunogenicity when 
administered as a genetic vaccine, A convenient screening method for these methods is to 
express the recombinant polynucleotides as fusion proteins to PIG, which results in display 

20 of the polypeptides on the surface of the host cell (WTiitehom et aL (1995) Biotechnology (N 
Y) 13:1215-9). Fluorescence-activated cell sorting is then used to sort and recover those 
cells that express an increased amount of the antigenic polypeptidie on the cell surface„ This 
prehminary screen can be followed by immunogenicity tests in mammals, such as mice. 
Finally, in preferred embodiments, those recombinant nucleic acids are tested as genetic 

25 vaccines for their ability to protect a test animal against challenge by the virus. 

The flaviviruses are another example of a viral pathogen for which the 
methods of the invention are useful for obtaining a recombinant polypeptide or genetic 
vaccine that is effective against a viral pathogen. The flaviviruses consist of three clusters of 
antigenically related viruses: Dengue 1-4 (62-77% identity), Japanese, St. Louis and Murray 

30 Valley encephalitis viruses (75-82% identity), and the tick-borne encephalitis viruses (77- 
96% identity). Dengue virus can induce protective antibodies against SLE and Yellow fever 
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(40-50% identity), but few efficient vaccines are available. To obtain genetic vaccines and 
recombinant polypeptides that exhibit enhanced cross-reactivity and immunogenicity, the 
polynucleotides that encode envelope proteins of related viruses are subjected to DNA 
■ shuffling:' The resulting recombinant polynucleotides can be tested, either as genetic 
5 ' 'vaccines or by using the expressed polypeptides, for ability to induce- a broadly reacting' • 
neutralizing antibody response. Finally, those clones that are' favorable in the preliminary 
' screens can be tested for ability to protect a test animal against viral challenge. 

^ "• . ' . Viral antigens that can be evolved by DNA shuffling for improved activity as 
vaccines include, but are not lirhited to, influenza A virus N2 neuraminidase (Kilboume^r 
10 al^ (1995) Vaccine 13: 1799-1803); Dengue virus envelope (E) and premembrane (prM) 
antigens (Feighny^r a/. (1994) /iw.y. Trop. Med Hyg/sO: 322-328- Pulnak et aL (1996) 
4m. J. Trop. Med, Hyg. 55: 504-10); HIV antigens Gag, Pol, Vif and Nef (Vogt et a!. (1995) ' 
Vaccine 13: 202-208); FIIV antigens gpl20,and gpl60 (Achour c-/ a/. .(1995) Cell Mol. Biol. 
■ 41:'395-400; Hone et ai (1994) Dev. Biol. Stand: S2:. 159-162); gp41 epitope of human ■ 
.15 immunodeficiency virus (Eckhart el a/.- (1996) 7. Cen. Virol. 77; 2001-2008); rotavirus 
antigen- VP4 (Mattione/ ^/."^( 1995) /. Wrc?/...69: 5132-5 137); the rotavirus protein VP7 or 
VP7sc(Emsliee/a/. (1995)y. ViroL 69: \747'1754; Xu e(Ml. (1995) J. Gen/^ 
1971-1980);^herpes simplex virus (HSV) glycoproteins- gB, gC, gD,.gE, gG, gH', .and/gI ' 
{¥hck.et aL (1994) MedvMicrobiol-I^^ 183: 87-94 [Mattion, 1995]; Ghiasr^r 

20 * al (1995)' Invest Ophthalmol. Vis. ScL 36: 1352-1360; McLean et a/. (1994) J..Infect: Disi 
170: 1 100-1109); immediate-early protein ICP47 of herpes simplex vims-type (HSV- 1) 
.(Banks era/. (1994) F/ro/og>i 200; 236-245); immediate-early (IE). p 
and ICP4 of herpes simplex virus (Manickan et aL (1995) J. Virol, 69: 4711-4716); influenza 
virus nucleoprotein and hemagglutinin (Deck er a/. (1997) Vaccine 15: 71-78; Fu etaL 
25 ' (1997)7: Firo/;* 71: 2715-2721); B19 parvovirus capsid proteins VP! (Kawase er a/. (1995) • 
Virology 211: 359-366) or VP2 (Brown et aL (1994) Virology 198: 477-488); Hepatitis B 
.vims core and e antigen (Schodel et aL (1996) Intervirology 39: 104-1.06); hepatitis B 
surface antigen (Shiau and Murray (1997) J. Med ViroL 51: 159-166); hepatitis B,surface 
antigen fused to the core antigen of the virus (Id.); Hepatitis B virus core-preS2 particles 
30 (Nemeckova et aL (1996) Acta ViroL 40: 273-279); HBV preS2-S protein (Kutinova et aL 
(1996) Vaccine 14:^ 1045-1052); VZV glycoprotein I (Kutinova et aL (1996) Vaccine 14: 
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1045-1052); rabies virus glycoproteins (Xiang et ai (1994) Virology 199: 132-140; Xuane/ 
al. (1995) Virus Res. 36: 151-161) or ribonucleocapsid (Hooper era/. (1994) Proc. Natl. 
Acad, Sci. USA 91: 10908-10912); human c>nomegalovirus (HCMV) glycoprotein B (UL55) 
(Britt et ai (1 995) /. Infect. Dis. 171:1 8-25); the hepatitis C virus (HCV) nucleocapsid 
"pfoteirTin a secreted or a nonsecrete^Torm, or ¥sl~fusi'6n protein'^ithlhe middlF(pre^S2^^ 
and S) or major (S) surface antigens of hepatitis B virus (HBV) (Inchauspe et al. (1997) 
DNACellBiol. 16: 185-195; Major era/. (1995) 7. Virol. 69: 5798-5805); the hepatitis C 
virus antigens: the core protein (pC); El (pEl) and E2 (pE2) alone or as fusion proteins 
(Saito et ai (1997) Gastroenterology 1 12: 1321-1330); the gene encoding respiratory, 
syncytial virus fusion protein (F;FP-2) (Falsey and Walsh (1996) Vaccine 14: 1214-1218; 
Piedra et ai (1996) Pediatr: Infect. Dis. J. 15; 23-31); the and VP7 genes of rotaviruses 
(Choi et ai {\991) Virology^ 232: \29A2%\ i'ln et ai {\996) Arch. Virol. 141:2057-2076); 
the EI, E2, E3, E4, E5, E6 and E7 proteins of human papillomavirus (Browner al. (1994) 
K/ro/ogy 201: 46-54; Dilhier era/. (1995) Cancer Detect. Prev. 19: 381-393; Krul etal. 
(1996) Cancer Immunol. Immunother. 43: 44-48; Nakagawa er a/. (1997)7, Infect. Dis. 175: 
927-931); a human T-Iymphotropic virus type I gag protein (Porter et ai (1995)7. Med. 
Virol. 45: 469-474); Epstein-Barr virus (EBV) gp340 (Mackett et al. (1996) J. Med. Virol. 
50: 263-271); the Epstein-Barr virus (EBV) latent membrane protein LMP2 (Lee et ai 
(1996) Eur, J. Immunol. 26; 1875-1883); Epstein-Barr virus nuclear antigens 1 and 2 (Chen 
and Cooper (1996) J. Virol. 70: 4849-4853; Khanna et al. (1995) Virology 2U: 633-637);'^ 
the measles virus nucleoprotein (N) (Fooks et al, (1995) Virology 210: 456-465); and 
cytomegalovirus glycoprotein gB (Marshall et aL (1.994)7 Med. Virol, 43: 77-83) or 
glycoprotein gH (Rasmussen et al. (1 994) 7 Infect. Dis. 1 70: 673-677). 

3. Parasites , ^ 

Antigens from parasites can also be optimized by the methods of the 
invention. These include, but are not limited to, the schistosome gut- associated antigens 
CAA (circulating anodic antigen) and CCA (circulating cathodic antigen) in Schistosoma 
mansoni, S. haematobium or S. japonicum (Decider et al. {1996) Parasitology 1 12: 21-35); a 
multiple antigen peptide (MAP) composed of two distinct protective antigens derived from 
the parasite Schistosoma mansoni (Femi et al. (1997) Parasite Immunol, 19: 1-11); 
Leishmaniajpaimic_.suThcQj^ 
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third-Stage larval (L'3) antigens of L. loa (Akue ei ai (1997) /. Infect. Dis. 175: 158-63); the • \ , ' ^\ 
• ■ genes, TamsI-1 and Tamsl-2, encoding the , 30-and 32-lcDa niajor.n^ ' • ' 

of Theileria armulata (Ta)-(d'Olrveira cr/. .(l99.6) Ge/7e 172; 33 -39)\ Plasmodium 
falciparum merozpite surface antigen 1 or 2 fal-Yam'an et al. (l995), Trans. R, Soc. Trop, . . 
*5 . 'Med:Hyg\ 89: 55.5-;559;;Beck et aii\991)J. Infect Dis. 175: 92.1.-926; Rzepczyk et al. ' ' 

{\991)Jnfect. Immun. 65: 1098- 11 00); circunisporozoite (CS) proteip-based B-e'pitppes from, " , 
Plasmodium berghei, {????N?m))2znd Plasmodium y 

Z?erg/ie/ T-helper epitope KQIRJDSIT^^ ' • ■ 

NYVAC-Pf7 Qucod^d Plasmodium falciparum antigens derived from the sporozoite . > ^ 
10' ' (circumsporGZoite protein and sporozoite surface protein 2),' liver (liver stage antigen 1), 
blood (merozoite surface protein i , senne repeat antigen, and apical membrane antigen 1), 
and sexual (25-kDa sexual-stage antigeii) stages of the parasite life cycle were inserted into a ■ 
■ : single NYVAC genome to generate NYVAC-Pf7 (Tine et al. (.1996) hifect, Immun. 64: ; . 

3S33-38A4);' Plasmodium falciparum amizGn Pfs230 (Williamson ei a!-. (1996) MoL - , 
15 Biochem, ParasitoL 78:/l61-l69); Plasmodium falciparum- zp\Oci\ membrane antigen (.\MA- ' ' 

I) (Lai et al. {1996) Infect. Immun. 64: 1054-1059); Plasmodium falciparum proteins Pfs28 . ' : 
. and Pfs25 (Duffy and Kaslow (1997) Infect. Immun. 65; 1 109-1 113); Plasmodium - 
' /a/cz>an/w merozoite, surface protein, MSPl (Hui ei ai (1996) Infect. Immun. 64: 1502- 
1509); the malaria antigen P032 (Ahlborg a/. (1996) /m/TiuA^^^^^ 
20 Plasmodium falciparum trs'Xhxozyxt mtmhxdnc ^^^^ 

Acad. Sci.USA93\3A91-3S02\Bimxc\\etaLX\^^ * ' 

,* falciparum merozoite surface antigen; PfMSP-l (Egan er al. (1996) / Infect. Dis. 173; 765- 

769); Plasmodium falciparum antigens SERA, EBA-175, RAPI and"RAP2 (Riley (1997) J. \ 
Pharm. Pharmacol. 49: 21-27); Schistosoma japonicum pararnyosin (Sj97) or.fragments 
25 thereof (Yang -.e/ a/. (1995) Biochem. Biophys. Res. Commun. 2\2: 1029 A 039)\ andHsp70 , 
in parasites (Maresca and Kobayashi (1994) 50: 1067-1074). 

' 4. Allergy 

The invention also provides methods of obtaining reagents that are useful for 
treating allergy. In one embodiment, the methods involve making a library of recombinant 
30 polynucleotides that encode an allergen, and screening the library to identify those 
recombinant polynucleotides that exhibit improved properties when used as 
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immunotherapeutiG reagents for treating allergy. For example, specific immunotherapy of 
allergy using natural antigens carries a risk of inducmg anaphylaxis, which can be initiated 
by cross-linking of high-affinity IgE receptors on masi cells. Therefore, allergens that are not 
recognized by pre-existing IgE are desirable. The , methods of the invention provide methods 
l5y-whicfiTonFcan"o&tairr^ 

induction of broader immune responses, increased safety and efficacy. 

Synthesis of polyclonal and allergen-speci fic IgE requires multiple 
interactions between B cells, T cells and professional antigen-presenting cells (APC). 
Activation of naive, unprimed B cells is initiated when specific B cells recognize the 

10 allergen by cell surface immunoglobulin (sig). However, costimulatory molecules expressed 
by activated T cells in both soluble and membrane-bound forms are necessary for 
differentiation of B cells into IgE-secreting plasma ceils. Activation of T helper ceils 
requires recognition of an antigenic peptide in the context of MHC class II molecules on the 
plasma membrane of APC, such as monocytes, dendntic cells, Langerhans cells or primed B 

15 cells. Professional APC can efficiently capture the antigen and the peptide-MHC class II 
complexes are formed in a post-Golgi, proteolytic intracellular compartment and 
subsequently exponed to the plasma membrane, where they are recognized by T cell 
receptor (TCR)(\VTiiiton (1998) Curr. Top. Microbiol. ImnnwoL 232: 1-13). In addition, 
activated B cells e.xpress CD80 (B7-1) and CD86 (B7-2, B70), which are the counter 

20 receptors for CD28 and which provide a costimulatory signal for T cell activation resulting 
in T cell proliferation and cytokine synthesis. Since allergen-specific T cells from atopic 
individuals generallybelong to the Th2 cell subset, activation of these celis also leads to 
production of IL-4 and IL-13, which, together with membrane-bound costimulatory 
molecules expressed by activated T helper cells, direct B cell differentiation into IgE- 

25 secreting plasma cells. 

Mast cells and eosinophils are key cells in inducing allergic symptoms in 
target organs. Recognition of specific antigen by IgE bound to high-affinity IgE receptors on 
mast cells, basophils or eosinophils results in crosslinking of the receptors leading to 
degranulation of the cells and rapid release of mediator molecules, such as histamine, 

30 prostaglandins and leukotrienes, causing allergic symptoms. 
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Immunotherapy of allergic diseases currently includes hyposensibilization 
treatments using increasing doses of allergen injected to the patient. These treatments result 
skewing of immune responses towards ThI phenotype and increase the ratio of IgG/IgE 
antibodies specific for allergens. Because these patients have circulating IgE antibodies , 
5 • specific for the allergens, these treatments include significant nsk of anaphylactic reactions. 
In these reactions, free circulating allergen is recognized hy IgE molecules bound to high- 
affinity IgE receptors on , mast cells, and eosinophils., Recognition of the allergen results in 
crosslinking of the receptors leading to release of mediators, such as histamine, 
prostaglandins, and leukotrienes, which cause the allergic symptoms, and occasionally 

1 0 anaphylactic reactions. Other problems associated with hv'posensibilization include low^ 
efficacy and difficulties in producing allergen extracts reproducibly. 

The methods of the invention provide a means to obtain allergens that, when 
used in genetic vaccines, provide a means of circumventing the problems that have Hmited 
the usefulness of previously known hvposensibilization treatments. For example, by 

15 expressing antigens on the surface of cells, such as muscle cells, the risk of anaphylactic 
reactions is significantly reduced. Thi*^ can be conveniently achieved by using genetic 
vaccine vectors that encode transmembrane forms of allergens. The allergens can also be 
rnodified in such a way that they are efficiently expressed in transmembrane forms, further 
■ reducing the risk of anaphylactic reactions. Another advantage provided by the use of 

20 genetic vaccines for hyposensibilization is that the genetic vaccines can include cytokines ■ 
and accessory molecules which further direct the immune responses towards the ThI 
phenotype, thus reducing the amount of IgE antibodies produced and increasing the efficacy 
of the treatments. To fiarther ireduce IgE production, one can administer the shuffled ■ 
allergens using vectors that have been evolved to induce primarily IgG and IgM responses, 

25 with little or no IgE response (see, e.'g,, US Patent Application Sen No, 09/021,769, filed 
February 1 1, 1998). 

In these methods, polynucleotides encoding known allergens, or homologs or 
ft*agments thereof (e.g., immunogenic peptides) are inserted into DNA vaccine vectors and 
used to immunize allergic and asthmatic individuals. Alternatively, the shuffled allergens are 

30 expressed in manufacturing cells, such as £. coli or yeast cells, and subsequently purified 
and used to treat the patients or prevent allergic disease. DNA shuffling or other 
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recombination method can be used to obtain allergens that activate T cells but cannot indui 
anaphylactic reactions. For example, a librarv' of recombinant polynucleotides that encode 
allergen variants can be expressed in cells, such as antigen presenting ceils, which are than 
contacted with PBMC or T cell clones from atopic patients. Those library members that 
:^efficiently-activateTH'ceirs~frowthe~atbpi^^ti^ 
proliferation, or by cytokine svnthesis (e.g:, synthesis of IL-2, IL-4, IFN-y. Those 
recombinant allergen variants that are positive in the in vitro tests can then be subjected to //; 
vivo testing. 

Examples of allergies that can be treated include, but are not limited to, 
allergies against house dust mite, grass pollen, birch pollen, ragweed pollen, hazel pollen, 
cockroach, rice, olive tree pollen, fungi, mustard, bee venom. Antigens of interest include 
those of animals, including the mite (e.g., Dermaiophagoides pteronyssmus, 
Dermaiopbagoides farinae. Blomia iropicalis), such as the allergens der p 1 (Scobie et ai, 
{199A) Biochem. Soc. Trans. 22: 44SS\ Yssd et al. (1992) J, fmmunoi. 148: 73S-745), der p2 " 
(Chmet ai {\996) Gin. Exp. A!lergy 26: 829-837), der p3 (Smith and Thomas (1996) Clin. 
Exp, Allergy 26: 571-579), der p5, der p V (Lin et a! (1994) / Allergy^' Clin. Immunol. 94: 
989-996), der p6 (Bennett and Thomas (1996) Clin, Exp. Allergy 26: 1 150-1 154), der p 7 
(Shen et ai (1995) Clin. Exp. Allergy 25: 416-422), der f2 (Yuuki et ai (1997) Int. Arch. 
Allergy ImmunoL 112: 44-48), der f3 (Nishiyama a/. (\995) FEBS Lett. 377: 62-66), der ^' 
f7 (Shen et ai (1995) Clin. Exp. Allergy 25: 1000-1.006); Mag 3 (Fujikawa et ai (1996) MoL 
ImmunoL 33: 311-319). Also of interest as antigens are the house dust mile allergens Tyr p2 
(Eriksson et ai (1998) £ur. J. Biochem. 251: 443-447), Lep dl (Schimidl a/. (1995) FEBS 
Lett. 370: 11-14), and glutathione S-transferase (O'Neill et al. (1995) Immunol Lett. 48: 103- . 
107); the 25,589 Da, 219 amino acid polypeptide with homology with glutathione S- 
transferases (O'Neill et ai (1994) Biochim. Biophys, Acta, 1219: 521-528); Bio t 5 (Arruda 
et ai (1995) Int. Arch. Allergy Immunol. 107: 456-457); bee venom phospholipase A2 
(Carballido et al. {\99A) J. Allergy Clin. ImmunoL 93: 758-767; Jutel e/ al, (1995)7. 
ImmunoL 154: 4187-4194); bovine dermal/dander antigens BDA 1 1 (Rautiainen et al. (1995) 
J. Invest, Dermatol, 105: 660-663) and BDA20 (Mantyjarvi et aL (1996) J, Allergy Clin, 
ImmunoL 97: 1297-1303); the major horse allergen Equ cl (Gregoire et al. (1996)7. Biol. 
Chem. 271: 32951-32959); Jumper antM pilosula allergen Myr p I and its homologous 
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allergenic polypeptides Myr p2 (Donovan ai (\996) Biochenh Mol Biol. Int. 39: 877- 
885); 1-13, 14, 16 kD allergens of the mite /rc?/7:cato (Caraballo ^/ ^Z. (1996) y. 

, Allergy Clin. Immimol. 98:. 573-579); the cockroach allergens Bla g Bd90K (Helm etal,, 
■ (1996) J. Allerg}' Clin. ImmunoL 98: 1 72-80) and Bia g 2.(Arruda et aL (1995) J. Biol. 
Chem. 270: 19563-19568); the cockroach Cr-PI allergens QNn et a/. (1996) /. ^BioL Chenu ^ 
271: 17937-17943); fire ant venom allergen, Soi i 2 (Schmidt et aL (1996) J. Allergy Clin. y 
ImmunoL 98: 82-88); the insect Chironomus thumnu.mdi]oi allergen Chi t 1-9 (Kipp et ai 
(1996) Int. Arc/L Allergy ImmunoL WO: 348-353); dog allergen Can f lor cat allergen Fel d 
1 (Ingram et aL (1995) / Allergy Clin. Immunol. 96: 449-456); albumin, derived, for * • 
example, from horse, dog or cat (GouBran Botros et aL (1996) Immunology 88: 340-347): • 
deer allergens with the molecular mass of 22 kD, 25 kD or 60 kD (Spitzauer a/i (1997) 
Clin.' Exp. Allergy^ 27;. 196-200); and the 20 kd major^ allergen of cow (Ylonen et al. (1994) J: 
Allergy^ Clin. ImmunoL 93: S51-S5S). ' " 

. Pollen and grass allergens are aiso-useful in vaccines,,particularly aiier 
optimization of the antigen by the methods of the invention. Such allergens include, for 
example, Hor'v9 (Astwood and Hill (1996) Gene IS2: 53-62, Lig vl (Batanero a/. (1996) 
Clin. Exp. Allergy 26: 1401-1410); Lol p 1 (Muller et aL (1996) Int. Arch. Allergy ImmunoL 
109: 352-355), Lol p II (TsunboviniM aL (1995) Mol. ImmunoL 32: 505-513), Lol pVA, Lol 
PYB (Ong et aL (1995) MoL ImmunoL 32: 295-302), Lol p 9 (Blaher et aL (1996) J. Allergy^ 
Clin. ImmunoL 9S\ 124-132); Par ] I (Cost^ et aL (1994) FEBS Lett. 341: 182-186; Saliusto. 
et aL (1996) J. Allergy Clin. ImmunoL 97: 627-637), Par j 2,0101 (Duro et aL (1996) FEBS 
Lett. 399: 295-298); Betvl (Ydbcvet aL. (1996)7. BioL Chem. 271: 19243-19250), Bet v2 
(Rihse/a/. (1994) Int. Arch.^ Allergy ImmunoL 105: j 90-1 94); Dac g3 (Guerin-Marchand 
a/,.(1996) Mol: ImmunoL 33: 797-806); Phi p 1 (Petersen etaL (1995) 1 Allergy Clin. 
ImmunoL. 95: 9S7 -994), Phi p 5 (Muller et aL (:\996)Jnt: Archr Allergy Immunol: 109: 352. • 
355), Phi p 6 (Petersen et aL (1995) Int. Arch. Allergy ImmunoL 108: 55-59); Cry j l ,(Sone et 
aL (1994) Biochem. Biophys. Res. Commun. 199:. 6 19-625), Cry j II (Namba era/. (1994) 
FEBS Lett. 353: 124-128); Cor a 1 (Schenk et aL(\994) Eur. J. Biochem. ^214: 717-722); 
cyn dl (Smith et al. (1996)7. Allergy Clin. ImmunoL 98: 331-343), cyn d7 (Suphioglu etaL 
( 1 997) FEBS Lett. 402: 1 67-1 72); Pha a I and isoforms of Pha a 5 (Suphioglu and Singh 
(1995) Clin. Exp. Allergy 25: 853-865); Cha o 1 (Suzuki et aL (1996) MoL ImmunoL 33: 
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451-460); profilin derived, e.g, from timothy grass or birch pollen (Valenta et aL (1994) 
■ Biochenj. Biophys, Res. Commun, 199: 106-118); P0149 (Wu ei aL (1996) Plant MoL Biol. 
32: 1037-1042); Ory si (Xu cv/. (1995) G^/je 164: 255-259); and Amb a V and Amb t 5 
(Kim et aL (1996) MoL Immunol. 33: 873-880; Zhu et al. (1995; J. Immunol. 155: 5064'- 
5073). ~ ^ . " 

Vaccines against food allergens can also be developed using the methods of 
the invention. Suitable antigens for shuffling include, for example, profilin (Rihs et aL 
(1994) Int. Arch. Allergyy ImmunoL 105; 190-194); rice allergenic cDNAs belonging to the 
alpha-amylase/trypsin inhibitor gene family (Alvarez et al. (1995) Biochim Biophys Acta . 
1251: 201-204); the main olive allergen, Ole e l (Lombardero etal. (1994) Clin Exp Allergy 
24: 765-770); Sin a 1 , the major allergen from mustard (Gonzalez De La Pena et aL ( i 996) 
Eur J Biochem. 237: 827-832); par\'aibumin, the major allergen of salmon (Lindstrom etal. 
(1996) Scand. J. ImmunoL 44: 335-344); apple allergens," such as the major allergen Mai d 1 
(Vanek-Krebitz et aL (1995) Biochem. Biophys. Res. Commun. 214: 538-551); and peanut 
allergens, such as Ara h I (Burks^i?^ a/, (1995)7. Clin, invest. 96: 1715-1721). 

The methods of the invention can also be used to develop recombinant 
antigens that are effective against allergies to fungi. Fungal allergens useful in these 
vaccines include, but are not limited to, the allergen, Cla h III, of Cladosporium herbarum 
(Zhang et al. (1995)7. ImmunoL 154: 710-717); the allergen Psi c 2, a fungal eyclophilin, 
from the basidiomycete Psilocybe cubensis (Homer et aL (1995) ////. Arch. Allergy ImmunoL 
107: 298-300); hsp 70 cloned from a cDNA library of Cladosporium herbarum (Zhang et aL 
- (1996) a//2 -4//erg>'-26: ■88-95);4he 68 kD allergen-of-Pe/i:di/zuw notatum {Shcn et-aL , 

(1995) Clin. Exp. Allergy 26: 350-356); aldehyde dehydrogenase (,ALDH) (Achatz et al. 
{\995) Mol ImmunoL 32: 213-227); enolase (Achatz etal. (1995) MoA ImmunoL 32: 213- 
227); YCP4 (Id.); acidic ribosomal protein P2 (Id.). 

Other allergens that can be used in the methods of the invention include latex 
allergens, such as a major allergen (Hev b 5) from natural rubber latex (Akasawa et al. 

(1996) 7 Bioi Chem. 271: 25389-25393; Slater e/ a/. (1996)7 Biol. Chem. 271: 25394- 
25399). 

The invention also provides a solution to another shortcoming of vaccination 
as a treatment for allergy and asthma. While genetic vaccination primanly induces CDS" T 
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cell responses, induction of allergen-specific IgE responses is dependent on T cells 
and their help to B cells. TH2-type cells are particularly efficient in inducing IgE synthesis 
because they secrete high levels of IL-4, IL-5 and IL-i:\ which direct Ig isotype switching to 
IgE s>Tithesis. IL-5 also induces eosinophiiia. The methods of the invention can be used to ^ 
5 develop recombinant antigens that efficiently induce CD4^ T cell responses, and direct . . 
differentiation of these cells towards the ThI phenotype. 

5. Inflammatory and Autoimmune Diseases 

Autoimmune'diseases are characterized by immune response that attacks 
tissues or cells of ones own body, or pathogen-specific immune responses that also are 

10 harmful for ones own tissues .or cells, or non-specific immune activation which is harmfu?, 

for one's own tissues or cells. Examples of autoimmune diseases include, but are not iimired , 
to, rheumatoid arthritis, SLE. diabetes melliius, myasthenia gravis, reactive arthritis, 
ankylosing spondyliiis. and multiple sclerosis. These and other infiammaton.^ conditions, 
including IBD, psoriasis, pancreatitis, and various immunodeficiencies, can be treated using 

15 antigens that are optimized using the methods of the invention. 

These conditions are often characterized by an accumulation of inflammator)' 
cells, such as lymphocytes, macrophages, and neutrophils, at the sites of inflammation. • . 
Altered cytokine production levels are often observed, with increased levels of cytokine 
production.' Several autoimmune diseases, including diabetes and rheumatoid arthritis, are - 

20 linked to certain MHC haplotypes. Other autoimmune-type disorders, such as reactive 
arthritis, have been shown to be triggered by bacteria such as Yersinia md Shigella, and 
evidence suggests that several other autoimmune diseases, such. as diabetes, multiple 
• sclerosis, rheumatoid arthritis, may also be initiated by viral or bacterial infections in 
genetically susceptible individuals. 

25 Current strategies of treatment generally include anti-inflammatory drugs, ' ' . / 

such as NSAID or cyclosporin, and antiproliferative drugs, such as methotrexate. These 
therapies are non-specific, so a need exists for therapies having greater specificity, and for 
means to direct the immune responses towards the direction that inhibits the autoimmune , 
process. 

30 The present invention provides several strategies by which these needs can be 

fulfilled. First, the invention provides methods of obtaining antigens having greater 
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tolerogenicity and/or have improved antigenicity. In a preferred embodiment, the antigens 
prepared according to the invention exhibit improved induction of tolerance by oral delivery. 
Oral tolerance is characterized by induction of immunological tolerance after oral 
administration of large quantities of aniige-;^. In animal models, this approach has proven to - 

—be^a-very-promising-approachnonreat-autoimmune-diseasesrand^clinicahtriais-arein^progress 
to address the efficacy of this approach in the treatment of human autoimmune diseases, 
such as rheumatoid anhritis and multiple sclerosis. It has also been suggested that induction 
of oral tolerance against viruses used in gene therapy might reduce the immunogenicity of 
. gene therapy vectors. However, the amounts of antigen required for induction of oral 
tolerance are very high and the methods of the invention provide a means for obtaining 
antigens that exhibit a significant improvement in induction of oral tolerance. 

Expression librar>' immunization (Barry et- ai ( 1 995) Nature 377: 632) is a 
particularly useful method of screening for optimal antigens for use in genetic vaccines. For 
example, to identify autoantigens present in Yersinia, Shigella, and the like, one can screen 
for induction of T cell responses m HLA-B27 positive individuals. Complexes that include 
epitopes of bacterial antigens and MHC molecules associated with autoimmune diseases, 
e,g., HLA-B27 in association with Yersinia antigens can be used in the prevention of 
reactive arthritis and ankylosing spondylitis in HLA-B27 positive individuals. 

Screening of optimized antigens can be done in animal models which are 
known to those of skill in the an. Examples of suitable models for various conditions 
include, collagen induced anhntis, the NTS/sld mouse model of human Sjogren's syndrome; 
a 120 kp organ-specificautoantigen recently identified as an analog of human cytoskeletai 
protein (a-fodrin (Haneji et al, (1997) Science 276: 604), the New Zealand BlackAVhiie F i 
hybrid mouse model of human SLE, NOD mice, a mouse model of human diabetes meliitus, 
fas/fas ligand mutant mice, which spontaneously develop autoimmune and 
lymphoproliferative disorders (Watanabe-Fukunaga er ai (1992) Nature 356: 314), and 
expenmental autoimmune encephalomyelitis (EAE), in which myelin basic protein induces a 
disease that resembles human multiple sclerosis. 

Autoantigens that can be shuffled according to the methods of the invention 
and used in vaccines for treating multiple sclerosis include, but are not limited to, myelin 
basic protein (Stinissen et ai (1996) 7. Neurosci. Res. 45: 500-51 1) or a fusion protein of 
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myelin basic protein and proieoiipid protein (Elliott e/ al. (1996) J, Clin. Invest. 98: 1602- 
1612), proteolipid protein (PLP) (Rosener et ai (1997) J. Neuroimmunol. 75: 28-34), 2',3'- 
cyclic nucleotide 3'-phosphodiesterase (CNPase) (Rosener et al. (1997) J. Neuroimmunol. 
75: 28-34), the.Epstein Barr virus nuclear antiren-l (EBNA-1) (Vaughan et al. (1996) / ' 
5 Neuroimmunol. 69: 95-102), HSP70 (Salvetti et ai (1996) J. Neuroimmunol. 65: 143-53; 
Feldmann^/^/. (1996) Ce//85:307). ^ ■ : ' < 

Target antigens that, after shuffling according to the methods of the invention, 
can be used to treat scleroderma, systemic sclerosis, and systemic lupus erythematosus 

include, for example, (-2-GPI, 50 kDa glycoprotein (Blank et at. (1994) J. Autoimmun. 7: 

10 441-455), Ku (p70/p80) autoantigen, or its 80-kd subunit protein {Hong et al. {\994) Invest, 
bphthalmoi .Vis. Sci. 35: 4023-4030; Wzngetal. {1994)1 Cell Sci, 107: 3223-3233), the 
nuclear autoantigens La (SS-B) and Ro (SS-A) (Huang et al. (1997) J. Clin. Immunol 17: 
212-219; Igarashi a/. (1995) Autoimmunity 22: 33-42: Keech'ei al (1996) Clin. Exp. 
Immunol. 104: 255-263; Manoussakis et al. (1995) J. .Autoimmun. S: 959-969; Topfer al. 
15 (1995) Proc. Nat 7. AcacL Sci. USA 92: 875-879), proteasome (-t>T5;e subunit C9 (Feist.e/ al. 
(1996) J. Exp. Med. 184: 1313-1318), Scleroderma antigens Rpp 30, Rpp 38 or Scl-70 (Eder 
et al. (1997) Proc. Nat 'I. Acad Sci. USA 94: 1 101-1106;'Hietarintaer a/. (1994) Br. J, ■ 
Rheumatol. 33: 323-326); the centrosome autoantigen PCM-1 (Baoer al. (1995) 
Autoimmunity 22: 219-228), polymyositis-scleroderma autoantigen (PM-Scl) (Kho et al. 
20 (1997) J. Biol. Chem. 272:- 13426-13431), scleroderma (and other systemic autoimmune 

disease) autoantigen CENP-A (Muro et ai (1996) Clin. Immunol. Immunopathol. 78: 86-89), 
U5, a small nuclear ribonucleoprotein (snRNP) (Okano et^l. (1996) Clin. Immunol. 
Immunopathol. ^\: 41-47), the 100-kd protein of PM-Scl autoantigen (Ge era/. (1996) 
Arthritis Rheum. 'i9: 1588-1595), the nucleolar U3- and Th(7-2) ribonucieopfoteins 
25 (Verheijen et al. (1994) J, Immunol Methods 1 69: 173- 182), the ribosomal protein L7 (Neu 
, et al. (1995) Clin. Exp. Immunol 100: 198-204), hPopl (Lygerou et al (1996) EMBO J, 15: 

5936-5948), and a 36-kd protein from nuclear matrix antigen (Deng al (1996) Arthritis 
' Rheum. 39:, 1300-1307). 

' Hepatic autoimmune disorders can also be treated using improved 
30 recombinant antigens that are prepared according to the methods described herein. Among 
the antigens that are useful in such treatments are the cytochromes P450 and UDP- 
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gliicuronosyl-transferases (Obermayer-Straub and Manns (1 996) Baillieres Clin. 
Gastroenterol. 10: 501-532), the cytochromes P450 2C9 and P450 1A2 (Bourdi etal. (1996) 
Chem. Res, Toxicol. 9: ! 159- 1 1 66; Clemente et al. (1997) J. Clin. Endocrinol Metab. 82; 
1353-1361), LC-1 antigen (Klein et aL (1996)7. Pediatr. Gastroenterol Nutr. 23: 461-465), 
5 and a 230-kDa Golgi-associated protein (Funaki et al (1996) CelTStruct, Fimct^ TiTe^-fl). 

For treatment of autoimmune disorders of the skin, useful antigens include, 
but are not limited to, the 450 kD human epidermal autoantigen (Fujiwara et al (1996)7. 
Invest, Dermatol 106: 1 125-1 130), the 230 kD and 180 kD bullous pemphigoid antigens 
(Hashimoto {\995)KeioJ. Med. 44: 1 15-123; Murakami etal (1996)7 Dermatol Sci. 13: 
10 1 12-1*17), pemphigus foliaceus antigen (desmoglein 1), pemphigus vulgaris antigen 
(desmogiein 3), BPAg2, BPAgl, and type VII collagen (Batteux et al (1997) 7 Clin, 
Immunol 17: 228-233; Hashimoto etal (1996)7 Dermatol Sci, 12: 10- IT), a i68-kDa 
mucosal antigen in a subset of patients with cicatncial pemphigoid (Ghohestani et al (1996) * 
7 Invest. Dermatol, 107: 136-139), and a 218-kd nuclear protein (21S-kd Mi-2) (Seeiig etal:'' 
15 (1995) Arthritis Rheum. 3S: 1389-1399). '■ 
The methods of the invention are also useful for obtaining improved antigens '''' 
for treating insulin dependent diabetes mellitus, using one or more of antigens which 
include, but are not limited to, insulin, proinsulin, 0.-^065 anc G.\D67, heat-shock protein 
65 (hsp65), and islet-cell antigen 69 (ICA69) (French et al (1997) Diabetes 46: 34-39; Roep 
20 (1996) Diabetes 45: 1147-1 156; Schloot et al (1997) Diabetologia 40: 332-338), viral 
proteins homologous to GAD65 (Jones and Crosby (1996) Diabetologia 39: 1318-1324), 
islet cell antigen-related protein-tyrosine phosphatase (FTP) (Cui et a/. (1996) J. Biol Chem. 
271: 24817-24823), GM2-1 ganglioside (Cavallo a/. (1996) 7 Endocrinol, 150: 113-120; 
Dotta et al (1996) Diabetes 45: 1 193-1 196), glutamic acid decarboxylase (GAD) (Nepom 
25 (1995) Curr. Opin. Immunol 7: 825-830; Panina-Bordignon et al. (1995) 7 Exp. Med. 181 : 
1923-1927), an islet cell antigen (ICA69) (Karges et aL (1997) Biochim. Biophys, Acta 1360: 
' 97-101; Roep et al (1996) Eun 7 Immunol. 26: 1285-1289), Tep69, the single T cell epitope 
recognized by T cells from diabetes patients (Karges et al. (1997) Biochim. Biophys, Acta 
1360: 97-101), ICA 512, an autoantigen of type I diabetes (Solimena a/. (1996) EMBOJ. 
30 15: 2102-21 14), an islet-cell protein tyrosine phosphatase and the 37-kDa autoantigen 

derived from it in t y pe 1 diabetes (includin g IA-2, IA-2) (La Gasse et al (1997) Mol M ed. 
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3:-163-173);the 64 IcDa protein from In-Ill cells or human thyroid follicular.cells.that is 
immunoprecipitated with sera from patients with islet cell surface antibodies (ICSA) (Igawa 
et a/; (1996) Endocr / 43: 299-3*06), phogrin, a homologue of the human^ transmembrane 
/protein tyrosine phosphatase, an autpantigen of type. I diabetes (Kawasaki er a/. (1,99)5) -." 
Biochem, Biophys. Res, Commwu 227:.440-447), the 4,0 kDa and 37 kDa tryptic fragments 
and their precursors IA-2 and IA-2 in IDDM.(Lampasona er qi {1996) X Immunol: 157: • 
2707-2711; Notkins a/. (1996) J. Autoimmun. 9: 677-682), insulin or a cholera toxoid- 
insulm conjugate (Bergeroi et al: (1 997) Proc. Nat 1 Acad Sci. USA 94: 46 1 0-46 14), " " 
carboxypeptidase H, the human homologue of gp330, which is a renal epithelial . ; • 
glycoprotein involved in inducing Hevmann nephritis in rats, and the 38-kD islet , 
mitochondrial autoantigen (Arden et al. (1996) 7. Clin. Invest, 97:.55h-56L 

Rheumatoid arthntis is, another condition that is treatable using optimized ■. 
antigens prepared according to the present invention. Useful antigens for rheumatoid arthritis' 
.'treatment include, but are not limited to, the 45 kDa DEK nuclear antigen, in panicular onset 
juvenile rheumatoid arthntis and iridocyclitis (Murray er ^2/. (1997) J. Rheumatol. 24: 560- 
567), human cartilage giycoprotein-39, an autoantigen in rheumatoid arthritis .(Verheijden et 
,aL (1997) Arthritis Rheum, 40: 1115-1 125), a 68k: autoantigen in rheumatoid arthritis (Blass 
et al. {'1997) Ann. Rheum. Dis, 56: 317-322), collagen (Rosloniec al. (1995)1 Immunol. 
155: 4504-451 1), collagen type II (Cook et aL (1996) Arthritis Rheum. 39: 1720-1727; 
Trentham (1996) Ann. N: Y. Acad. 5c/., 778: 306-314), cartilage link protein. (Guerassimov er 
a/.- (1997) J. Rheumatol. 24: 959-964), ezrin, radixin' and moesin, which are auto-immune 
antigens in rheumatoid arthritis (Wagatsuma e^ al, (1996) Mol Immunol. 33: 1171-1176), 
and mycobacterial heat shock protein'65 (Ragno er al. (1997) Arthritis Rheum. AO: 277-283). 

Also amoiig the conditions for which one can obtain an iniproved aiiti^^^ 
suitable for treatment are autoimmune thyroid disorders. Antigens that are useful for these 
applications include, for example, thyroid peroxidase and the thyroid stimulating hormone 
receptor (Tandon and Weetman (1994) J. iJ. Coll. Physicians Lond, 2%: 10-18), thyroid 
peroxidase from human Graves' thyroid tissue (Gardas et aL {\997) Biochem.. Biophys, Res. 
Commun. 234: 366-370; Zimmer et al, (1997) Histochem. Cell Biol. 107: 1 15-120), a 64- 
kDa antigen associated with thyroid-associated ophthalmopathy (Zhang et al: (1996) Clin, 
Immunol Immunopathol. 80: 236-244), the human TSH receptor (Nicholson aL (1996) J. 
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MoL EhdocrinoL 16: 1 59-170), and the '64 kDa.protein from In-1 1 1 cells or human thyroid 
follicular cells that is immunoprecipitated with sera from patients with islet cell surface 
antibodies (ICSA) (Ig'awaer a/. ( 1996) , • ' " ' 

— Other conditions and associated antigens include, but are not:limited to, 

Sjogren's syndrome (^fodnnTHaheji et al7{ X991TScience 275r604H507), myastenia gravis 
(the human M2 acetylcholine receptor or fragments thereof- specifically the second 
extracellular loop of the human M2 acetylcholine receptor; Fu et al. (1996) Clin. Immunol. 
Immunopathol: 78; 203-207), vitiligo (tyrosinase; Fishman et aL (1 997) Cancer 79: 146 1 - 
•1464), a 450,lcD human epidermal autoantigen recognized by sefozm .from individual, with 
10 ' blistering skin disease, and ulcerative cohtis (chromosomal proteins HMGl and HMG2: 

Sob^imdi etaL{\99'^) CI m. Exp. Immunol A^O^ ' . . ' 

6. ■ Cancer . ' ■ • ' . ' 

Immunotherapy has great promise for ihe treatment of cancer and prevention , 
\ of metastasis. , By inducing an immune response against cancerous cells, the body's immune ■ 
15 system can be enlisted- to reduce or eliminate cancer. Improved antigens obtained using the 

methods of the invention provide cancer imrnunotherapies of increased etTectiveness " ' 

compared to those that are presently available. ' ■ V. ' ' , ' 

•. , . One approach to cancer irnmunotherapy' is vaccination using vaccines that ■ ' 

include or encode antigens that are specific for tumor cells or by injecting the patients with 
20 purified recombinant cancer antigens. The methods of the invention can be used for - • ' 
■ obtaining antigens that exhibit an enhancement of immune responses against known tumor- 

- -specific ■antigens,-and-also-'to -search for novel-protective antigenic sequencesr:Antigens - - - - 

having optimized expression, processing, and presentation can be obtained as described 
herein. The approach used for each particuiar cancer' can vary'. For treatment oi hormone- /, 
25 ^ sensitive cancers (for example, breast cancer and prostate cancer), methods of the invention 
can be used to obtain optimized hormone antagonists. For highly immunogenic tumors, • 
including melanoma/one can screen for recombinant antigens that optimaUy b 
inlmune response against the tumor. Breast cancer, in contrast, is of relatively low 
immunogenicity and exhibits slow progression, so individual treatments can be designed for 
30 each patient. Prevention of metastasis is also a goal in design of cancer vaccines. 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 ^ ' PCT/US99/02944 

44 . 

■ - * Among the tumor-specific antigens that can be used in the antigen shuffling 

methods of the invention are: bullous pemphigoid antigen 2, prostate mucin antigen (PMA) 
(Beckett and Wright (l,995) //lA 7, Cancer 62: 703-710), tumor associated Thomsen- 
Friedenreich antigen (Dahlenborg er a/. H 997) Int: J. Cancer 70: 63-71), prostate-specific _ 
5 antigen (PSA) (Dannull and Belldegrun (1997) Br J, Urol. 1: 97-103), luminal epithelial ; 
antigen (LEA.135> of breast carcinoma and bladder transitional cell carcinoma/TCC) (Jones 
et ah{\991) Anticancer Res, 17: 685-687),, cancer-associated serum antigen (GASA) and 
cancer antigen' 125 (C A 125) (Kierkegaard a/., (1995) 

epitheliaf glycoprotein 40 (EGP40) (Kievit et aL, {1997} Int. J. Cancer 71 : 237.-245),' 
10' squamous cell carcinoma antigen ■(SGC) (Lozza et ai (1997) Anticancer Res. 17: 525-529), 
cathepsin E (Mota er^a/. {\997)-AnL J. PaihoL. \50: ,1212 A229), tyrosinase in melanoma * 
(Fishman et ai, {\991}Cancer 79: 1461-1464), cell nuclear antigen (PCNA) of.cerebral 
cavemonias "(Notelet et ai (1 991) Surg. NeuroL 47: 364-370), DF37^[UC 1 breast cancer 
antigen (Apostolopoulos et ai. (1996) ImmunoL Celi Biol, 74: 457-464; Pandey et al. (1995) 
15 Cancer iRe^.. 55:: 4000-4003), carcinoembryonic antigen (Paone et al. (1996) J. Cancer Res. 
Clin: OncoiAll: 499-503; Schlom et al. (1996) Breast Cancer Res. Treat. 38: 27-39), ' 
tumor-associated antigen CA 19-9 (Tolliver and 0'BrienX1997) South Med. J.-9Q: 89-90; 
Tsuruta et^ai (.1997) Urol, Int. 58: 20-24), human melanoma antigens MART-1/Melan-A27- 
. ■ ' 35 and gplOO (Kawakami and Rosenberg (1'997) Int. Rev. ImmunoL 14: '173-192; Zajac et al. 
20 .(1997) Int, /.Ca/icer 71 :.49 1-496), the T and Tn pancarcinoma (CA) glycopeptide epitopes ' 
' (Springer (1995) Crit, Rev. Oncog. 6: 57-85), a 35'kD timior-associated autoantigen in 
' papillary thyroid carcinoma (Lucas et ai (1996) Anticancer Res. 16: 2493-2496), KH-.l 
•adenocarcinoma antigen (Deshpande'and Danishefsky {1997) Nature 387: 164-166), the A60 
mycobacterial antigen (Maes et ai (1996) J. Cancer Res. Clin. Oncol. 122: 296-3 00),- heat 
' " 25 shock proteins (HSPs) (Blachere arid Srivastava ( 1995) Cancer Biol. 6: 3A9-355), md 

MAGE, tyrosinase, melan-A and gp75 and mutant oncogene products {e.g., p53, ras, and 
HER-2/neu (Bueler and Mulligan (1996) Moi Med. 2: 545-555; Lewis and Houghton (1995) 
Semin. Cancer Biol. 6: 321-327; Theobald et al, (1995) Proc. Nat 7. Acad, Sci. USA 92: - 
11993-11997), 
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7. Contraception 

Genetic vaccines that contain optimized antigens obtained by the methods of 
the invention are also usef^jl for contraception. For example, genetic vaccines can be 
obtained that encode sperm cell specific antigens, and thus induce anti-sperm immune 
"5~~~responses~Vaccination-can-be achieved-byrfor-exampleradministration-of^ 



bacterial strains, e^. Salmonella and the like, which express sperm antigen, as well as by 
induction of neutralizing anti-hCG antibodies by vaccination by DNA vaccines encoding 
human chorionic gonadotropin (hCG), or a fragment thereof 

Sperm antigens which can be used in the genetic vaccines include, for 

10 example, lactate dehydrogenase (LDH-C4), galactosyltransferase (GT), SP-10, rabbit sperm 
autoantigen (RSA), guinea pig (g)PH-20, cleavage signal protein (CS-1), HSA-63, human 
(h)PH-20, and AgX-1 (Zhu and Naz (1994) Arch. AndroL 33: 141-144), the synthetic sperm 
peptide, PlOG (O'Rand et ai (1993)7. Reprod. ImmunoL 25: 89-102), the 135kD.. 95kD, 
65kD, 47kD, 41kD and 231cD proteins of sperm, and the FA-1 antigen (Naz et ai (1995) 

15 Ardh Androl. 35: 225-231), and the 35 kD fra^men: of cmkeratin 1 (Lucas et al. (1996) 
Anticancer Res. 16: 2493'2496). 

The methods of the invention can also be used to obtain genetic vaccines that 
are expressed specifically in testis. For example, polynucleotide.sequences that direct 
expression of genes that are specific to testis can be used {e.g., fertilization antigen- 1 and the 

20 like). In addition to sperm antigens, antigens expressed on oocytes or hormones regulating 
reproduction may be useful targets of contraceptive vaccines. For example, geneiic vaccines 
can be used to generate antibodies against gonadotropin releasing hormone (GnRH) or zona 
pellucidaproteins(Miller^r a/. (1997) Vaccine 15:1858-1862). Vaccinations using these 
molecules have been shown to be efficacious in animal models (Miller et ai (1997) Vaccine 

25 15:1858-1862). Another example of a useful component of a genetic contraceptive vaccine 
is the ovarian zona pellucida glycoprotein ZP3 (Tung et ai (1994) Reprod, Fertil. Dev. 
6:349-355). 

Methods of Selecting and Identifying Optimized Recombinant Antigens 

Once one has performed DNA shuffling to obtain a library of polynucleotides 
30 that encode recombinant antigens, the library is subjected to selection and/or screening to 
jdenti fy-thQse-lihrary-memberS-that-encode-antigenic_peptid.e$_that have impr oved abilit y to 
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induce an inomune response to the pathogenic agent.' Selection and screening of recombinant 
polynucleotides that encode poKpeptides having an improved ability to induce an immune 
■ response can involve either in vivo and in vitro methods, but most often involves a 
.combination of these methods. For example, in a typical embodiment the members of a 
5 library of recombinant nucleic acids are picked, either individually or as pools. The clones 
can be subjected to analysis directly, or can be expressed to produce the corresponding ' 
. .polypeptides. In a presently preferred embodiment, an in viiro screen is performed to 
identify the best candidate sequences for the in vivo studies. Alternatively, the library can be 

• subjected to in vivo challenge studies directly. The ahalyses can employ either the nucleic 
10 acids themselves (eg., as genetic vaccines), or the polypeptides encoded by the nucleic 

acids. A schematic diagram of a tvpical strategy is shown in Figure 5 . Both in vitro and in 

• v/,vo methods are described in more detail below. 

If a recombination cycle is performed in vitro, the products of recombination, 
i.e., recombinant .segments, are sometimes introduced into cells before the screening step. 
1 5 ' Recombinant segments can also be linked to an apprcpnaie vector or other regulators- 
sequences before screening. Alternatively, products of recombination generated in vitro are 
sometimes packaged in viruses (e.g., bacteriophage) before screening. If recombination is 
. performed in vivo, recombination products can sometimes be screened in the cells in which 
recombination occurred. In other applications, recombinant segments are extracted from the- 
20 cells, and optionally packaged as viruses, before screening. 

Often, improvements are achieved after one round of recombination and ' 
selection. However, recursive sequence recombination can also be employed to achieve still 
further improvements in a desired property, or to bring about hew (or "distinct") properties. 
Recursive sequence recombination entails successive cycles of recombination to generate 
'25 ' molecular diversity . That is, one creates a family of nucleic acid molecules, showing some 

sequence identity to each other but differing in the presence of mutations. In any given cycle, 
recombination can occur in vivo or in vitro, intracellularly or extracellularly. Furthermore, 
diversity resulting from recombination can be augmented in any cycle by applying prior 
methods of mutagenesis {e.g., error-prone PCR or cassette mutagenesis) to either the 
30 substrates or products for recombination. 
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In a presently preferred embodiment, polynucleotides that encode optimized 
recombinant antigens are subjected to molecular backcrossing, which provides a means to 
breed the shuffled chimeras/mutants back to a parental or wild-tvpe sequence, while 
retaining the mutations that are critical to the phenotype that provides the optimized immune 
5 responses. In addition to removing the neutral mutations, molecuIaFblckcrossing caiTalso 
be used to characterize which of the many mutations in an improved variant contribute most 
to the improved phenotype. This cannot be accomplished in an efficient library fashion by 
any other method. Backcrossing is performed by shuffling the improved sequence with a 
large molar excess of the parental sequences. 

1 0 The nature of screening or selection depends on what property or 

characteristic is to be acquired or the property or characteristic for which improvemeni is 
sought, and many examples are discussed below. It is not usually necessar>' to understand 
the molecular basis by which panicular products of recombination (recombinant segments) 
have acquired new or improved properties or charactenstics relative to the starting 

15 substrates. For example, a gene thai encodes an antigenic polypeptide can have many 
component sequences each having a different intended role (see, e.g., Figure 4). Each of 
these component sequences can be varied and recombined simultaneously. Screening/ 
selection can then be performed, for example, for recombinant segments that have increased 
ability to induce an immune response to a pathogenic agent without the need to attribute 

20 such improvement to any of the individual component sequences of the recombinant 
polynucleotide. 

Depending on the particular screening protocol used for a desired property, 
initial round(s) of screening can sometimes be performed using bacterial cells due to high 
transfection efficiencies and ease of culture. However, especially for testing of immunogenic 

25 activity, test animals are used for library expression and screening. Similarly other ty^pes of 
screening which are not amenable to screening in bacterial or simple eukaryotic library cells, 
are performed in cells selected for use in an environment close to that of their intended use. 
Final rounds of screening can be performed in cells or organisms that are as close as possible 
to the precise cell type or organism of intended use. 

30 If further improvement in a property is desired, at least one, and usually a 

collection, of recombinant s egments surviving a first round of screening/selection are subject 
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to a: further round of recombination. These recombinant segments can be recombined with 
each other or with exogenous segments representing the original substrates or further 
variants thereof. Again., recombination can proceed in viiro or in vivo. If the previous 
screening step identifies desired recombinant segments as components 9f cells, the 

■ 5 cornponents can be subjected to further recombination in vivo, or can be subjected to funher - 
recombination in vitro, or can be isolated before performme a round of in vitro / 
recombination. Conversely, if the previous screening step identifies desired recombinant 
segments in naked form or as components of viruses, these'segments can be introduced into 
■ ^cells to perform a round of in vivo recombination. The second round of recombination, 

10 irrespective how performed, generates further recombinant segments which encompass 

additional diversity than is present in recombinant segments resulting from previous rounds. 

The second round of recombination can be followed by a further round of 
screening/'selection according to the principles discussed above for the first round. The 
stringency of screening-^selection can be increased between rounds. Also, the nature of the 

15 , screen and the propeny being'screened for can vary between rounds if im.provemem in more 
■ than one property is desired or if acquiring more than one new propeny is desired. 
Additional rounds of recombination and screening can then be performed until the 
• recombinant segments have sufficiently evolved to acquire the desired new or improved 
' property or function. . 

20 The practice of thiS: invention involves .the construction of recombinant 

nucleic acids and the expression of genes in transfected host cells. Molecular cloning 
techniques to achieve these ends are known in the an. A wide variety of cloning and in vuw 
amplification methods suitable for the construction of recombinant nucleic acids such as 
expression vectors are weU-knov.T) to persons of skill. General texts which describe 

25 molecular biological techniques useful herein, including mutagenesis, include Berger and , 
Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology volume 152 
Academic Press, Inc., San Diego, CA (Berger); Sambrook ei qL; Molecular Cloning - A 
Laboratory Manual (2nd Ed,), Vol. 1-3, Cold Spnng Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 ("Sambrook'*) and Current Protocols in Molecular Biology, F.M. 

30 Ausubel et ai, eds.. Current Protocols, a joint venture between Greene Publishing 

Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 1998) ("Ausubel")). 
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Examples of techniques sufficient to direct persons of skill through in vitro amplification 
methods, mcluding.the polymerase chain reaction (PCR) the ligase chain reaction (LCR), Q - 
replicase amplification and other RNA polymerase mediated techniques {e.g., NASBA) are 
round in Berger, Sambrook, and Ausubel, as well as Mullis et ai (1987) U.S. Patent No. 

— ™5 ^4-6&3r2Q2^PGR-Protocols-A~Giude-to-AfethodS"an 

Press Inc. San Diego, CA (1990) (Innis); Amheim & Levinson (October 1, 1990) C&EN 
36-47; The Journal Of NIH Research (I991>3, SI -94; (Kwoh et al. (1989) Proc. Natl. Acad. 
Sci: USA 86, 1 173; Guatelli et al, (1990) Proc. NatL Acad. Sci. USA 87, 1874; Lomell i?/ ai 

(1989) y. Clin.Chem35, 1826; Landegren a/. (19&8) Science 24\, 1077-1080; Van Brunt 
10 (1990) Biotechnology^ 8, 291-294; \Vu and Wallace (1989) Gene 4, 560; Bamnger et al. 

(1990) Gene 89, 117, and Sooknanan and Malek (1995) Biotechnolog:/ 13: 563-564. 
Improved methods of cloning in vitro amplified nucleic acids are descnbed m Wallace et uv., 
U.S. Pat. No. 5,426,039. Improved methods of a^mplifying large nucleic acid.s by PCR are 
summanzed in Cheng et ai (1 994) Nature 369: 684-685 and the references 'therein, in which i; 

15 . PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially air/ 
RNA can be converted into a double stranded DN.A suitable for restriction digestion, PCR 
expansion and sequencing using reverse transcriptase and a polymerase. See, Ausubel, 
Sambrook and Berger, all supra. ^-n 
Oligonucleotides for use as probes, e.g:, in in vitro amplification methods, for • 

20 use as gene probes, or as shuffling targets {e.g., synthetic genes or gene segments) are 
typically synthesized chemically according to the solid phase phosphoramidite triester 
method described by Beaucage and Caruthers (1981) Tetrahedron letts.. 22(20): 1859- 1 862, 
e.g., using an^automated synthesizer, as described in Needham-VanDevanter et ai (1984) 
Nucleic Acids Res,, 12:6159-6168. Oligonucleotides can also be custom made and ordered 

25 from a variety of commercial sources known to persons of skill. 

Indeed, essentially any nucleic acid with a known sequence can be custom 
ordered from any of a variety of commercial sources, such as The iMidland Certified Reagent 
Company (mcrc@oligos.com), The Great American Gene Company 
(http://wvm.genco.com),' ExpressGen Inc. (www.expressgen.com), Operon Technoloigies 

30 Inc. (Alameda, CA) and many others. Similarly, peptides and antibodies can be custom 
ordered from any of a variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI 
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Biorproducts, Inc. (http:/Av\v\v.htibio.com). BMA Biomedicals Ltd (U.K.), Bio SvTithesis, 
Inc., and maiiy others. 

1. Purification and in vitro analysis of recombinant nucleic acids and 
polypeptides 

'5 Once DNA shuffling has been performed, the resulting library of recombinant 

polynucleotides can be subjected to purification and preliminary analysis //? vitro, in order to 
identify the most promising candidate recombinant nucleic acids. Advajitageously, the 
assays can be practiced in a high-throughput format. For example, to purify individual 
shuffled recombinant antigens, clones can robotically picked into 96-well formats, grown, 
10 and, if desired, frozen for storage. 

WTiole cell lysates (V-antigen), penplasmic extracts, or culture supematants ■ 
(toxins) can be assayed directly by ELISA as described below, but high throughput 
purification is sometim.es also needed. Affinity chromatography using immobilized, 
antibodies or incorporation of a small nonimmunogenic affinity tag such as a hexahistidine 
15 'peptide with immobilized metal affinity chromatography will allow rapid protein 

purification. High binding-capacity.reagents with'96-vvell filter bottom plates provide a high 
throughput purification process. The scale of culture and purification, vvill depend on protein 
yield, but initial studies will require less than 50 micrograms of protein. Antigens showing 
' improved properties can be purified in larger scale by FPLC for re-assay and animal 
20 challenge studies. 

In some embodiments, the shuffled antigen-encoding polynucleotides are . 
■ assayed as genetic vaccines. Genetic vaccine vectors containing the shuffled antigen ^ 
sequences can be prepared using robotic colony picking and subsequent robotic plasmid 
purification: Robotic plasmid purification protocols are available that allow purification of 
25 600-800 plasmids per day. The quantity and purity of the DNA can also be analyzed.in 96- 
well plates, for example. In a presently preferred embodiment, the amount of DNA in each 
sample is robotically normalized, which can significantly reduce the variation between 
different batches of vectors. 

Once the proteins and/or nucleic acids are picked and purified as desired, they 
30 can be subjected to any of a number oiin vitro analysis methods. Such screenings include, 
for example, phage display, flow cytometry, and ELISA assays to identify antigens that are 
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efficiently expressed and have multiple epitopes and a proper folding pattern. In the case of 
bacterial toxins, the libraries may also be screened for reduced toxicity in mammalian cells. 

As one example, to identify recombinant antigens that are cross-reactive, one 
can use a panel of monoclonal antibodies for screening. A humoral immune response. 
5 generally targets multiple regions of antigenic proteins. Accordingly, monoclonal antibodies"^ 
can be raised against vanous regions of immunogenic proteins (Alving et ai (1995) 
■ Immunol Rev. 145: 5). In addition, there are several examples of monoclonal antibodies that 
only recognize one strain of a given pathogen, and by definition, different serotypes of • 
pathogens are recognized by different sets of antibodies. For example, a panel of 
10 monoclonal antibodies have been raised against VEE envelope proteins, thus providing a 
means torecognize different subt>pes of the virus (Roehrig and Bolin (1997) 1 Clin. 
Microbiol, 35: 1887). Such antibodies, combined with phage display .and ELISA screening, 
can be used to enrich recombinant antigens that have epitopes, from multiple pathogen 
, strains. Flow cviometry based cell soning will further allow for the selection of variants ihat 
15 are most efficiently expressed. 

Phage display provides a powerful me^.hod for selecting proteins of interest ^' 
from large libraries (Bass et ai (1990) Proteins: Struct. Funct, Genet, S: 309; Lowman and 
Wells (1991) Methods: A Companion to Methods Enz, 3(3);205-216. Lowman and Wells 
(1993) J. Mol BioL 234;564-57S). Some recent reviews on the phage display technique *' 
20 include, for example, McGregor (1996) Mol BiotechnoL 6(2): 155-62; Dunn (1996) Curr 
Opin. BiotechnoL 7(5):547-53; Hill et ai (1996) Mol Microbiol 2d(4):685-92; Phage 
Display of Peptides and Proteins: A Laboratory Manual. BK,K^y, .1 Winter, J, McCafferty 
eds., Academic Press 1996; O'Neil et aL (1995) Curr Opin. Struct. BioL 5(4):443-9; 
Phizicky et ai (1995) Microbiol Rev. 59(1):94-123; Clackson et ai i\ 994) Trends 
25 BiotechnoL 12(5):173-84; Felici et aL (1995) BiotechnoL Annu. Rev. 1:149-83; Burton 

(1995) Jmmunotechnology l(2):87-94.) See^also, Cwirla et aL, Proc. NatL Acad. ScL USA 
87: 6378-6382 (1990); Devlin et aL, Science 249: 404-406 (1990), Scott & Smith, Science 
249: 386-388 (1990); Ladner et aL, US 5,571,698. Each phage particle displays a unique 
variant protein on its surface and packages the gene encoding ^that particular variant. The 
30 shuffled genes for the antigens are fused to a protein that is expressed on the phage surface, 
e.g., gene III of phage Ml 3, and cloned into phagemid vectors. In a presently preferred 
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embodiment, a suppressible stop codon {e.g., an amber stop codonj separates the genes so 
- that in a suppressing strain ofE. coli, the antigen-glllp fusion is produced and becomes 
incorporated into phage particles upon infection vyith Ml 3 helper phage. The same vector 
can direct,production of the unfused antigen alone in a nonsuppressing E. coli for protein 
5 purification. - / ' . . - 

The genetic packages most frequently used for display libraries are 
bacteriophage, panicularly filamentous phage, and especially phage M13, Fd and Fl. Most 
work has involved inserting libraries encoding polypeptides' to be displayed into either glll 
or gVIII of these phage forming a fusion protein. See, e.g., Dower,*WO 91/T9S18; Devlin, 

10 WO 91/189.89; MacCafferty, WO 92/01047 (gene III); Huse, WO 92/06204; Kang, WO 
92/18619 (gene VIII). Such a fusion protein comprises a signal sequence, usually but not 
necessarily, from the phage coat protein, a polypeptide to be displayed and either the gene III 
or gene VIII protein or a fragment thereof Exogenous coding sequences are often inserted 
at or near the N-terminus of gene III or gene- VIII although other'insertion sites are possible. 

15 Eukaryotic viruses can be used to display polypeptides in an analogous 

manner. For example, display of human heregulin fused , to gp70 of Moloney murine 
leukemia virus has been reported by Han et ai^ Proc, Natl. Acad, Set. USA 92: 9747-9751 
' (1995). Spores can also be used as repHcable genetic packages. In this case, polypeptides 
are displayed from the outer surface of the spore. For example, spores from.^. subtilis have 

20 been reported to be suitable. Sequences of coat proteins of these spores are provided by 

Donovan et aL, 1 MoL Biol. 196, I -10 (1987). Cells can also be used as replicable genetic 
packages. Polypeptides to be displayed are inserted into a gene encoding a cell protein that 
is expressed on the cells surface. Bacterial cells including 5a/morte//<3 typhimurium. Bacillus 
subtilis, Pseudomonas aeruginosa. Vibrio cholerae, Klebsiella pneumonia. Neisseria 

IS ' gonorrhoeae. Neisseria meningitidis, Bacteroides nodosus, Moraxella i?ov/.s, and especially 
Escherichia coli are preferred. Details of outer surface proteins are discussed by Ladner et - 
al, US 5,571,698.and references cited therein. For example, the lamB protein of E, coli is ; 
suitable. - 

A basic concept of display methods that use phage or other replicable genetic 

30 package is the establishment of a physical association between DNA encoding a polypeptide 
to be screened and the polypeptide. This physical association is provided by the replicable 
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genetic package, which displays a polypeptide as pan of a capsid enclosing the genome of 
the phage or other package, wherein the polypeptide is encoded by the genome. The 
establishment of a physical association beuveen polypeptides and their genetic material 
allows simultaneous mass screening of very large numbers of phage bearing different 

— 5— -pol-ypeptidesr Phage Qisplaying-a-pol\peptideJ^v:ith-affmity_to 

to the target and these phage are enriched by affmity screening to the target. The identity of . 
polypeptides displayed from these phage can be determined from their respective genomes. 
Using these methods a polypeptide identified as. having a binding affinity for a desired target 
can then be synthesized m bulk by conventional means, or the polynucleotide that encodes 
1 0 the peptide or polypeptide can be used as part of a genetic vaccine. 

Variants with specific binding propenies. in this case binding to famiiy- 
specific antibodies, are easily ennched by panning 'A'ith immobilized antibodies. Aniibodies 
specific for a single family are used in each round of panning to rapidly select variajViis that • 
have multiple epitopes from the antigen families. For example, A-family specillc antibodies ^ 
15 can be used to select those shuftled clones that display A-specific epitopes in the first round 
of panning. A second round of panning with B-speciilc antibodies will select irom the '*A" 
clones those that display both A- and B-specific epitopes. A third round of panning with C- 
specific antibodies will select for variants with A, B, and C epitopes. A continual selection , . 
exists dunng this process for clones that express well in E. coli and that are stable throughout .. 
20 the selection, improvements in factors such as transcnption, translation, .secretion, lolding 
and stability are often observed and will enhance the utility of selected clones for use in 
vaccine production. 

Phage ELISA methods can be used to rapidly characicrize individual viuiants. 
These assays provide a rapid method for quantitation of variants ^v^lhout requiring 

25 purification of each protein. Individual clones are arrayed into 96- well plates, grown, and 
frozen for storage. Cells in duplicate plates are infected with helper phage, grown overnight 
and pelleted by centnfugation. The supematants containing phage displaying particular 
variants are incubated with immobilized antibodies and bound clones are detected by anti- 
M13 antibody conjugates. Titration series of phage particles, immobilized antigen, and/or 

30 soluble antigen competition binding studies are all highly effective means to quantitate 
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pratein binding. Variant antigens displaying multiple epitopes will be. further studied in 
• appropriate animal challenge models. 

Several groups have reported an in vitro ribosome display system for the 
screening and selection of mutant proteins with desired properties from large libraries. This 
'5 technique can be iised similarly to phage display.to select or enrich for vanant antigens with 
improved propeilies such as broad cross reactivity to antibodies and improved folding {see, ■ 
e.g., Hanes et al: (1997) Proc.Nai 7. Acad. Sci. USA 94(l,0):4937-42; Mattheakis etal. '^ 
(1994) Proc. Nat 7. Acad. Sci. C/5.>1 91 (19): 902 2-6; He et al. (1997) NucL Acids Res. 
; .. 25(24):5 132-4; Nemoto eM/.( 1-997) 41 4(2):405 ' ■ 

10 ■ Other display methods e.xist to screen antigens'for improved properties such 

as increased expression levels, broad cross reactivity, enhanced foldmg and stability. These 
include', but are not hmi'ted to display orproteins on mtact £. coii or other ceils {e^g., 
' Francisco etat. {\993) Proc. Natl Acad. Sci. USA 90: 1044-1044'S; Lu efaf. (1995) . ■ 
: Bio/Technology- 13: 366-372). Fusions of shuffled amigens to DNA-binding proteins can ' 
15 link the antigen protein to. its gene in aii expression vector (Schatz ai. {\996'j Methods 
: ' ■ .£nzvmc?/; 267: 171-91; Gates a/. (1996) y. A/o/.5;o/..255: 373-86.) ... 
, : ■ ■ ' ■ The various display methods and ELISA assays can be used to screen for 
' shufned, antigens with improved properties such as presentation of multiple epitopes, . ' 
improved immunogenicity, increased expression levels, increased folding rates and ■ 
20 efficiency, increased stabilityto factors such as temperature, buffers, solvents:' improved ■ 

' purification properties, etc. Selection of shuffled antigens with improved expression, foidihg, 
stability and purification profile under a variety of chromatographic conditions can be ven,' 
important improvements to incorporate for the vaccine manufacturing process. 

To identify recombinant antigenic pol>T)eptides that exhibit improved, 

25 expression in a, host cell, flow cytometry is a.useful technique. Flow cytomeiry-proyides a . 
method to efficiently analyze the functional properties of millions of individual cells. One 
can analyze the expression levels of several genes simultaneously, and flow cytometry-based 
cell sorting allows for the selection of cells that display properly expressed antigen variants 
on the cell surface or in the cytoplasm. Very large numbers (>10') of cells can.be evaluated 

30 in a single vial experiment! and the pool of the best individual sequences can be recovered 
from the sorted cells. These methods are particularly usefiil in the case of. for example. 
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Hantaan virus glycoproteins, which are generally ver\' poorly expressed in mammalian cells. 
This approach provides a general solution to improve expression levels of pathogen antigens 
in mammalian cells, a phenomenon that is critical for-the function of genetic vaccines. 

To use flow cytometry to analyze polv'pepiides that are not expressed on the 
5 cell surface, one can engineer the recombinant polynucleotides inlfiFliSrary sucfthaTthe 
polynucleotide is expressed as a fusion protein that has a region of amino acids which is 
targeted to the cell membrane. For example, the region can encode a hydrophobic stretch of 
C-terminal amino acids which signals the attachment of a phosphoinositol-glycan (PIG) 
tenninus on the expressed protein and directs the protein to be expressed on the surface of 

10 the transfected cell (WTiitehom et aL (1995) Biotechnology (N Y) 1,3:1215-9). With an ' 
antigen that is naturally a soluble protein, this method will likely not affect the three 
dimensional folding of the protein m this engineered fusion with a new C-terminus. With an 
antigen that is naturally a iransmembrane protein (e.g,, a surface membrane protein on 
pathogenic viruses, bactena, protozoa or tumor cells) there are at !t:ast two possibilities.. . 

15 First, the. extracellular domain car. be engineered to be in fusion with the C-terminal 

sequence for signaling PIG-linkage. Second, the protein can be expressed in toto relying on 
the signalling of the host cell to direct it efficiently to the cell surface. In a minority of cases, 
the antigen for expression will have an endogenous PIG terminal linkage (e.g., some 
antigens of pathogenic protozoa). 

20 Those cells expressing the antigen can be identified with a fluorescent 

monoclonal antibody specitlc for the C-terminal sequence on PIG-linked forms of the 
surface antigen. FACS analysis allows quantitative assessment of ihe level of expression of 
the correct form of the antigen on the cell population. Cells expressing the maximal level of 
antigen are sorted and standard molecular biology methods are used to recover the piasmid 

25 DNA vaccine vector that conferred this reactivity. An alternative procedure that allows 

purification of all those ceils expressing the antigen (and that may be useful prior to loading 
onto a cell sorter since antigen expressing cells may be a very small minority population), is 
to rosette or pan-purify the cells expressing isurface antigen. Rosettes can be formed 
between antigen expressing cells and erythrocytes bearing covalently coupled antibody to 

30 the relevant antigen. These are readily purified by unit gravity sedimentation. Panning of the 
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cell population over petri dishes bearing inunobilized monoclonal antibody specific for the 
, relevant antigen can also ,be. used to remove unwanted cells. . . , 

. Inthe hi gh, throughput assays of the invention, it is possible to screen up to , 
several thousand different shuffled vanants in a single day. For. e.x'ample, each well of a 
microtiter plate can' be used to run a separate assay, or,, if concentration or incubation time- 
, effects are to be observed, ever>' 5-10 wellsxan test a single variant. Thus, a single standard 
microtiter plate can assay about 100 (e.^.;96) reactions. , If 1536 well plates are used, then a 
single plate can easily assay, from about 100 to about 1500 different reactions. It is possible 
to assay several different plates per'day; assay screens for up to about. 6,000-20,000 different 
assays ^'(/.^., involving different nucleic acids, encoded proteins, concentrations, etc.) is 
possible using the integrated systems of the invention. More recently,, microfluidic 
approaches to reagent manipulation have been developed, e,g,, by Caliper Technologies 
(Palo Alto, CA). ' ' ■ ■ ■ ; / , . * ■ . 

In one aspect, librars-' members, e-.^V, ceils, viral plaques, or the like, are 
separated' on solid media to produce individual colonies (or plaques). Using an automated 
colony picker {e.g., the Q-bot, Genetix, U.K.), colonies or plaques are identified, picked, and 
up to 10,00.0 different mutants- inoculated into' 96 well microtiter dishes, optionally . 
containing glass balls in the wells to prevent aggregation. The Q-bot does not pick an entire 
colony but rather inserts a pin . through the center of the colony and exits with a small 
sampHng of cells (or viruses in plaque applications). The time the pin is in the colony, the 
number of dips to inoculate the culture medium, and the time the pin is in that medium each 
effect inoculum size, and each can be controlled and optimized. The uniform process of the 
Q-bot decreases human handling error and increases the rate of establishing cultures, 
(roughly 10,000/4 hours). These cultures are then shaken in a temperature and humidity 
controlled incubator. The glass balls in^the microtiter plates act to promote uniform aeration 
of cells dispersal of cells, or the like, similar to the blades of a fermentor. Clones from 
cultures ;pf interest can be cloned by limiting dilution. Plaques or cells constituting Hbraries 
can also be screened directly for production of proteins, either by detecting hybridization, 
protein activity, protein binding to antibodies, or the like. 

The ability to detect a subtle increase in the performance of a shuffled library 
member over that of a parent strain relies on the sensitivity of the assay. The chance of 
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finding. the organisms having an improvement in ability to induce an immune response is 
increased by the number of mdividual mutants that can be screened by the assay. To increase 
the chances of identifying a pool of sufficient size/aprescreen that increases the number of 
mutants processed by 1 Orfold can be used. The goal of the prescreen will be to quickly 
5 identify mutants iiaving equal"oFbetter product fiters than the parent strain(s)'aiKl to move 
onfy these mutants forward to liquid cell culture for subsequent analysis. 

A number of well known robotic systems have also been developed for 
solution phase chemistries useful in assay systems. These systems include automated 
workstations like the automated synthesis apparatus developed by Takeda Chemical 
10 Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate II, 
Zymark Corporation. Hopkinton, Mass.; Orca, Hewiett-Packard, Palo Alto, Calif) whkh 
mimic the manual svnthetic operations performed by a scientist. .Any of the above devices 
are suitable for use with the present invention, e.g., for high-throughput screening of 
molecules encoded by codon-altered nucleic acids. The nature and implementation of 
15 modifications to these devices (if any) so that they ca^n operate as discussed herein with 
reference to the integrated system will be apparent to persons skilled in the relevant an. 

High throughput screening systems are commercially available (see, e.g., 
Zymark Corp.i Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman' 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc). These systems 
20 typically automate entire procedures including all sample'and reagent pipetting, liquid 

dispensing, timed incubations; and final readings of the microplate in detector(s) appropriate 
, ..for the assay. . These configurable systems proyide high throughput and rapid start up as well 
as a high degree of flexibility and customization. 

The manufacturers of such systems provide detailed protocols the various 
25 high throughput. Thus, for example, Zymark Corp. provides technical bulletins describing 
screening systems for detecting the modulation of gene transcription, ligand binding, and the 
like. Microfluidic approaches to reagent manipulation have also been developed, e.g., by 
Caliper Technologies (Palo Alto, CA). 

Optical images viewed (and, optionally, recorded) by a camera or other 
30 recording device (e.g., a photodiode and data storage device) are optionally further processed 
in any of the embodiments herein, e.g., by digitizing the image and/or storing and analyzing 
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; the image on a computer. As noted above, in some applications, the signals resulting from 

assays are florescent, making optical detection approaches appropriate in these instances. A- 
' variety ofcornmercialiy available peripheral equipment and software is ava^ 

digitizing, storing and analyzing a digitized video or digitized optical image, e.g., using PC 
5 (Intel x86 or Pentium chip- compatible DOS , 0S2 WINDOWS , WINDOWS NT or 

WrNDOWS95 based machines), MACINTOSH , or L^TX based (e.^., SUN work station) 

computers. 

One. conventional system carries light from the assay device to a cooled 
charge-coupled device (CCD) camera, in common use in the art. A CCD camera includes an 

10 array of picture elements (pixels). The light from the specimen is imaged on the CCD. 
Particular pixels corresponding to regions of the specimen (e.g.. individual hybridization 
sites on an array of biological polviners) are sampled to obtain light intensity readings for 
each position. Multiple pixels are processed in parallel -:o increase speed. The apparatus and 
methods of the mvention are easily used for viewing any sample, e.g., by tluorescem or dark 

15 field microscopic techniques. 

Integrated systems for analysis in the present invention typically include a 
digital computer with high-throughput liquid control software, image analysis software, data 
interpretation software, a robotic liquid control armature for transferring solutions from a ' * 
source to a destination operably linked to the digital computer, an input device (e.g., a. 

20 ' computer keyboard) for.entering data to the digital computer to control high throughput 
liquid transfer by the robotic liquid control armature and, optionally, an image scanner for 
digitizing label signals from labeled assay component. The image scanner interfaces with 
the image analysis software to provide a measurement of optical intensity. Typically, the 
intensity measurement is interpreted by the data interpretation software to show whether the 

25 optimized recombinant antigenic polypeptide products are produced. 

2, Antige n Library Immunization 

In a presently preferred embodiment, antigen library immunization (ALI) is 
used to identify optimized recombinant antigens that have improved immunogenicity. ALI 
involves introduction of the library of recombinant antigen-encoding nucleic acids, or the 
30 recombinant antigens encoded by the shuffled nucleic acids, into a test animal. The animals 
are then subjected to in vivo challenge using live pathogens. Neutralizing antibodies and 
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cross-protective immune responses are studied after immunization with the entire libraries, 
pools and/or individual antigen Variants. • . . 

Methods of immunizing test animals are -well known to those of skill in the- 
art. In presently preferred, embodiments, test animals are immunized, twice or three times at 
5-— two-week"intervalsr0ne-week'aflertheiast"irnmuriization7the~ari^^ 

li ve pathogens (or mixtures of pathogens), and the survival and s>Tnptoms of the animals is 
followed. Immunizations using test animal challenge are described in, for example, 
Roggenkamp et al, (1997) Infect. Immun. 65: 446;AVoody et al. (1997) Vaccine 2: 133; 
Agren et.al. (1997) y. Immunol. 158: 3936; Konishi et al. (1992) Virology 190: 454; Kinney 
10 e/ aL (1988) /. ViroL 62: 4697; lacono-Connors el al (1996) Virus Res. 43: 125; Kochel ei 
ai:{l997) Vaccinel5:547',andChuetaL(\995)J. ViroL '69:'64l7,. 

The immunizations can be perforrried by injecting either the recombinant 
pql>Tiucleotides themselves, i.e.. as a genetic vaccine, or by immunizing the animals wiih 
' polypeptides encoded by the recombinant polynucleotides.. Bacterial antigens are typicanv 
15 screened. primarily as recom.binant proteins, whereas viral antigens are preferablv analyzed ■ 
using genetic vaccinations. - • ^ 

To dramatically reduce the number of experiments required to identify 
individual antigens having'improved immunogenic properties, one can use pooling and 
deconvolution, as diagranimed in Figure 6. Pools of recombinant nucleic acids, or 
20 polypeptides encoded by the recombinant nucleic acids, are used to immunize test animals. 
Those pools that result in protection against pathogen challenge are then subdivided and 
* subjected to additional analysis. The high througihput in virro approaches described above 
can be used to identify the best candidate sequences for the in vivo studies. 

, . The challenge models that can be used to screen for protective anigeus 
25 include pathogen and toxin models, such. as Yersinia bacteria, bacterial toxins (such as - 

Staphylococcal and Streptococcal enterdtoxins, Exoli/V. cholerae enterotoxins),- Venezuelan 
equine encephalitis virus (VEE)/Flaviviruses (Japanese encephalitis virus, Tick-bome 
encephalitis virus, Dengue virus), Hantaan virus, Herpes simplex, influenza virus (e.g., 
Influenza A virus), Vesicular "SiiomzixXxs V'xmz, Pseudomonas aeritginosa. Salmonella 
30 ryphimurium, Escherichia coli, Klebsiella pneumoniae. Toxoplasma gondii, Plasmodium 
yoelii,. Herpes simplex, influenza virus (e.g., Influenza A virus), and Vesicular Stomatitis 
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. Vims. However, the test animals can also be challenged with tumor cells to enable 
screening of antigens that efficiently protect against malignancies. Individual shuffled 
antigens or pools of antigens are introduced into the animals intradermally, intramuscularly, 
intravenously, intratracheally,' anally, vaginally, orally, or intraperitoneally and antigens that 
5 can prevent the disease are chosen, when desired, for further rounds of shuffling and 
selection. Eventually, the most potent antigens, based on in vivo data in test animals and 
comparative in vitro studies in animals and man, are chosen for human trials,, and their 
capacity tO'prevent and treat human diseases is investigated. 

In some embodiments, antigen library immunization and pooling of 

1 0 individual clones is used to immiinize against a pathogen strain that was not included in the 
sequences that were used to generate the library. The level of crossprotection provided by ' . ' 
different strains of a given.pathogen can significantiy. However, homologous titer is always ■ 
higher than heterologous titer. Pooling and deconvoiution is espccialiy efficient in models 
where minima! protection is provided by the wild-type antigens used as starting material for 

15 .shuffling (for example mmimal protection by antigens A and B against strain C in Figure - 
3B). This approach can be taken, for example, when evolving the V-antigen-of Yersinae or 
Hantaan virus glycoproteins. , 

In some embodiments, the desired screening involves analysis of the imrnune 
response based on immunological assays knowm to those skilled in the art. Typically, the 

20 test animals are first immunized and blood or tissue samples are collected for example one to 
two weeks after the last immunization. These studies enable one to one can measure 
immune parameters that correlate to protective immunity, such as induction of specific 
antibodies (particularly IgG) and induction of specific T lymphocyte responses, in addition 
to determining whether an antigen or pools of antigens provides protective immunity. 

25- Spleen cells or peiipherai blood mononuclear cells can be isolated from immunized test. 

animals and measured for the presence of antigen-specific T cells and induction of cytokine 
synthesis. ELIS A, ELISPOT and cytoplasmic cytokine staining, combined with flow 
cytometry, can provide such information on a single-cell level. 

Common immunological tests that can be used to identify the efficacy of 

30 immunization include antibody measurements, neutralization assays and analysis of . 

activation levels or frequencies of antigen presenting cells or lymphocytes that are specific 
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for the antigen or pathogen. The test animals that can be used in such studies include, but 
are not limited \o\ mice, rats, guinea pigs, hamsters, rabbits, cats, dogs, pigs and monkeys.. 
Monkey is a particularly useful test animal because the MHC molecules of monkeys and 
humans are ver\' similar. ' . ' ■ . . 

"^5^™ Virus neutralization assays are usefuFfordetection of antibodies thai not only 

specirlcaily bind to the pathogen, but also neutralize the function of the virus. These assays 
■ are typically based on detection of antibodies in the sera of immunized animal and analysis ■ 
of these antibodies' for their capacity to inhibit viral grovrlh-in tissue culture cells. Such 
assays are known to those skilled in the art. One example of a virus neutralization assay is 
10 described by Dolin R (J. Infect. Dis: 1995, 172:1 175-83): Virus neutralization assays 
provide means to screen for antigens that also provide protective immunity. - 

In some embodiments, shuffled antigens are screened for their capacity to 
induce T cell activation in vivo. More specifically, peripheral blood mononuclear cells or 
spleen cells from injected mice can be isolated and the capacity of cytotoxic T lymphocytes • 
15 to lyse infected, autologous target cells is studied. Tne spleen cells can be reactivated with 
the specific antigen v//ro. in addition,. T helper cell activation and differentiation is 
. analyzed by measuring cell proliferation or production of Th I (IL-2 andTFN-y) and Th2 (IL- • 
4 and IL-5) cytokines by ELISA and directly in CD4" T cells by cytoplasmic cytokine '''' 
staining and flow cytomet^>^ Based on the cytokine production profile, one.can also screen ' 
20 for alterations in the capacity of the antigens to direct Th1/Th2 differentiation (as evidenced, 
for example, by changes in ratios of IL-4/[FN-y, IL-4/1L-2, IL-S/TFN-y, IL-5,TL-2, IL- ' 
J JL-p/IL-Z). The analysis of the^T celUcjivaU by.the antigen:..Yanants.;. . . . 

is a very useful screening method, because potent acdvation of specific T cells in vivo 
correlates to induction:of protective immunity. 
25 The frequency of 2btigen-specificCD8"T cells />i v/vo can also be directly 

analyzed using tetramers of MHC class I molecules expressing specific peptides derived 
from the corresponding pathogen antigens (Ogg and McMichael, Curr Opin, Immunol 
1998, 10:393-6; Altman et aL, Science 1996, 274:94-6). The bindingof the tetramers can 
be detected using flow cytometry, and will provide information about the efficacy of the 
30 shuffled antigens to induce acdvation of specific T cells. For example, flow cytometry and 
tetramer stainings provide an efficient method of identifying T cells that are specific to a 
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given antigen or peptide. .Another method involves panning using plates coated with 
tetramers with the speci fic peptides. This method allows large numbers of cells to te 
handled in a short time, but the method only selects. for highest expression levels. The 
higher the^ frequency of antigen-specitlc T cells in vivo is, the more efficient the 
5' immunization has been, enabling identification of the antigen variants that have the most 
potent capacity to induce protective immune responses. These studies are particularly useful 
when conducted in monkeys, or other primates, because the MHC class I molecules of . 
humans mimic those of other primates more closely than those of mice.; 

■ Measurement of the activation of antigen presenting cells (APC) in response ' 

10 to immunization by antigen variants is another useful screening method. Induction of APC 
activation can be detected based on changes in surface expression levels of activauon 
'antigens, such as B7-1 (CD80), B7-2 (CD86), MHC class I and.IL CD14, CD23, and Fc 
receptors, and the like. 

Shuffled cancer antigens that induce cy^totoxic T cells that have the capacity 

15 to kill cancer cells can be identified by measuring the capacity of T cells derived /rom , 

immunized animals to kill cancer cells in vitro. Typically the canc.er cells are firstiabeied 
with radioactive isotopes and the release of radioactivity is an indication of tumor cell killing 
after incubation in the presence of T cells from immunized anir.ials. Such-cytotoxicity 
assays are knowTi in the art. 

20, ' An indication of the efficacy of ah antigen to activate T cells specific for, for 

example, cancer antigens, allergens or autoanrigens, is also the degree of skin inflammation 
when the antigen is injected into the skin of a patient or test animal; Strong inflammation is 
correlated with strong activation of antigen-specific T cells. Improved activation of tumor- 
specific T cells may lead to enhanced killing of the tumors. In case of autoantigens, one can 

25 add immunomodulators that skew the responses tovvards Th2; whereas in the case of 

allergens a ThI response is desired. Skin biopsies can be taken, enabling detailed studies of 
the type of immune response that occurs at the.sites of each injection (in mice and monkeys 
large numbers of injections/antigens can be analyzed). Such studies include detection of 
changes in expression of cytokines, chemokines, accessory molecules, and the like, by cells 

30 upon injection of the antigen mto the skin. 
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To screen for antigens that have optimal capacity to activate antigen-specific 
T cells, peripheral blood mononuclear cells from previously infected or immunized humans 
individuals can be used. This is a particularly useful method, because the MHC molecules 
that will present the antigenic peptides are human MHC molecules. Peripheral blood 
5 mononuclear cells or purified professional antigen-presenting cells (APCs) can be isolated 
from previously vaccinated or infected individuals or from patients with acute infection with 
the pathogen of interest. Because these individuals have increased frequencies of pathogen- 
specific T cells in circulation, antigens expressed in PBMCs or purified APCs of these 
individuals will induce proliferation and cytokine production by antigen-specific CD4"*^ and 
10 CDS"^ T cells. Thus, antigens that simultaneously harbor epitopes from several antigens can 
be recognized by their capacity to stimulate T cells from various patients infected or 
immunized with different pathogen antigens.. cancer antigens, .iutoantigens or ailergens. One 
buffy coat derived from a blood donor contains lymphocytes from 0.5 liters of biood, and up 
tolO'^PBMC can be obtained, enabling ver\' large screening experiments using T cei ls from 
15 one donor. 

When healthy vaccinated individuals (lab volunteers) are studied, one can 
make EB V-transformed B cell lines from these individuals. These cell lines can be used as 
antigen presenting cells in subsequent experiments using blood from the same donor; this 
reduces interassay and donor-to-donor variation. In addition, one can make antigen-specific 

20 T cell clones, after which antigen variants are introduced to EBV nransformed B cells. The 
efficiency with which the transformed B cells induce prohferauon of the specific T eel), 
clones is then studied. When working with specific T cell clones, the proliferation ajnd 
cytokine synthesis responses are significantly higher than when using total PBMCs, because 
the frequency of antigen-specific T cells among PBMC is very low. 

25 CTL epitopes can be presented by most cells types since the class i major 

histocompatibility complex (MHC) surface glycoproteins are widely expressed. Therefore, 
transfection of cells in culture by libraries of shuffled antigen sequences in appropriate 
expression vectors can lead to class I epitope presentation. If specific CTLs directed to a 
given epitope have been isolated from an individual, then the co-culture of the iransfected 

30 presenting cells and the CTLs can lead to release by the CTLs of cytokines, such as IL-2, 

DFN- y, or TNT, if the epitope is presented. Higher amounts of released TNF will correspond 
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to more efficient processing and presentation of the class I epitope from the shuffled, 
evolved sequence. Shuffled antigens that induce cytotoxic T cells that have the capacity to 
kill infected cells can also be identified by measuring the capacity of T cells derived from 
■ immunized animals to kill infected cells in vitro. T>T>ically the target cells are first labeled 
with radioactive isotopes and the release.of radioactivity is an indication of target cell killing 
.after incubation in the presence of T cells from immunized animals.' Such cytotoxicity 
assays are known in the art. 

A second method for identifying optimized CTL epitopes does not require the 
isolation of CTLs reacting with the epitope. In this approach, cells expressing class I MHC 
surface glycoproteins are transfected with the library of evolved sequences as above. After 
suitable incubation to allow for processing and presentation, a detergent soluble extract is 
prepared from' each cell culture and after a partial purification of the MHC-epitope complex ■ 
(perhaps optional) the products are submitted to mass speciromeir/ (Henderson et al. (1993) 
Proc. Nat l. Acad. ScL USA 90: 10275-10279). Since the sequence is known of the epitope 
whose presentation to be increased, one can calibrate the mass spectrogram to identify this 
peptide. .In addition, a cellular protein can be used for intemalxalibration to obtain a ■ 
quantitative result;, the cellular protein used for internal calibration could be the MHC 
molecule itself Thus one. can measure the amount of peptide epitope bound as a proportion 
of the MHC molecules. 

Use of Recombinant Multivalent Antigens 

The multivalent antigens of the invention are useful for treating and/or 
preventing the various diseases and conditions with which the respective antigens are 
associated. For example, the multivalent antigens can be expressed in a suitable host cell and 
are administered in polypeptide form. Suitable formulations and dosage regimes for vaccine 
delivery are well known to those of skill in the art. • , 

In presently preferred embodiments, the optimized recombinant 
polynucleotides that encode improved allergens are used in conjunction with a genetic 
vaccine vector. The choice of vector and components can also be optimized for the particular 
purpose of treating allergy. For example, the polynucleotide that encodes the recombinant 
antigenic polypeptide can be placed under the control of a promoter, e.g., a high activity or - 
tissue-specific promoter. The promoter used to express the antigenic polypeptide can itself 
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be optimized using recombination and selection methods analogous to those described 
herein. The vector can contain immunostimulatorv' sequences such as are described in 

copending, commonly assigned US Patent Application Serial No. , entitled 

''Optimization of Immunomodulatory Molecules," filed as TTC Attorney Docket No. 18097- 
■^030300US"on~Febru'ary1"0rr999~Av*ec^ 
for many of the immune responses mediated by the antigens described herein {see, e.g., 

copending, commonly assigned US Patent Application Serial No. , entitled 

"Genetic Vaccine Vector Engineering/' filed on February 10, 1999 as TTC Attorney Docket 
No. 1 8097-0301 OOUS): It is sometimes advantageous to employ a genetic vaccine that is 
targeted for a particular target cell type (e.g. , an antigen presenting cell or an antigen 
processing cell); suitable targeting methods are described in copending, commonly assigned 

US patent application Serial No. , entitled 'Targeting of Genetic Vaccine 

Vectors/' filed on February 10, 1999 as TTC Attorney Docket No. 1 S097.030200US. 

Genetic vaccines that encode the multivalent antigens described herein can be 
delivered to a mammal (including humans) to induce a therapeutic or prophylactic immune 
response. Vaccine delivery vehicles can be delivered in vivo by administration to an 
individual patient, t>i3ically by systemic administration (e.g., intravenous, intraperitoneal, 
intramuscular, subdermal, intracranial, anal, vaginal, oral, buccal route or they can be 
inhaled) or they can be administered by topical application. Alternatively, vectors can be 
delivered to cells ex vivo, such as cells explanted from an individual patient {e.g., 
Ijmiphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem 
cells, followed by reimplantation of the cells into a patient, usually after selection for ceils 
which have incorporated the vector. 

A large number of delivery methods are well known to those of skill in the 
art. Such methods include, for example liposome-based gene delivery (Debs and Zhu (1993) 
WO 93/24640; Mannino and Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose 
U.S. Pat No. 5,279,833; Brigham (1991) WO 91/06309; and Feigner et aL {mi) Proc. Natl. 
Acad. Sci, USA 84: 7413-7414), as well as use of viral vectors (e.g., adenoviral {see. e.g., 
Btms etaL {1995) Ann., NY Acad. Sci. 772: 95-104; Ali et al. (1994) Gene Ther. 1: 367-384; 
and Haddada et al. (1995) Curr. Top. Microbiol Immunol. 199 ( Pt 3): 297-306 for review), 
papillomaviral, retroviral {see, e.g.. Buchscher et al. (1992) J. Virol. 66(5) 2731-2739; 
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.. Jqhann al. (1992) J. VirdL 66 (5): 1635-1640 (1992);,Sommerfelt'e/ ai, {1990) ViroL 
176:58-59; Wilson et al. (1989) J. Virol. 63:2374-2378; Miller et al., J.. Virol. 65:2220-2224' 
(1991); Wong-Staal e/ a/., PCT/US94/05700, and Rosenburg and Fauci (1993) in 
Fundamental immunology, TJiird Edition Paul (ed) Raven Press, Ltd., .New York and the 
5 refe'rences therein, and Yu et ql:; Gene Therapy (1994) supra.),' said adeno-associated viral • 
... vectors (see. West et.al: (1987) Virology 160:38-47; Garter e/ (■1989) U.S. Patent No. \ 
4,797,368; Carter e-r al. WO 93/24641 (1993); Kotin (1994) Human Gene Therapy . 5:793- 
801; Muzyczka.(1994).y. Clin. Invst. 94: 1351 and Samulski (jupra) for an overview of A.W 
vectors; see also, Lebkowski, U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol, Cell 
10 Biol. 5(ll):3251-3260; Tratschin, et ai .(1984) Mol. CelL BioL, 4:2072-2081; Hermonat, and ■ 
Muzyczka (1984) Proc. Natl: Acad, Sci. USA, 81:6466-6470; McLaughlin et al, (1988) and 
Samulski a/. (1989)7. Wro/-, 63:03822-3828)^ ■ ■ . ■ 

■ " ■ ' . ■ . ''Naked" DNA and'br RNA that compnses a genetic vaccine can be 
' 'introduced directly into a tissue; such as muscle. See, e.g., USPN 5;5S0,859. Other methods 
15 such as *'biolistic" or panicle-mediated transformation {see, e.g., Sanford et a/., USPN , 

4,945,050; USPN 5,036,006) are also suitable for introduction of genetic vaccihes'into cells 
of a mammal according to the invention. These methods are useful not only fox in vivo > ' 
introduction.of DNA into 'a mammal, but also for ex vivo modification of cells for 
reintroduction into a mammal. As for other methods of dehvenng genetic vaccines, if 
20 ■ necessary, vaccine 'administration is repeated in order to maintain the desired level of : ' 
inununornodulation. . ' • . 

Genetic vaccine vectors (e.g., adenoviruses, liposomes, papillomaviruses,. 
' retroviruses, etc.) can be administered directly to the mammal for transduction of cells i>7 
vivo. The genetic vaccines obtained using the methods of the invention can be formulated as 
25 phannaceutical compositions for administ^^^ in any suitable maimer, including parente^^ 
(e.g., subcutanepusv intramuscular, intradermal, or intravenous), topical, oral, rectal, 
intrathecal, buccar(e.g„ sublingual), or local administration, such as by aerosol or, 
transdermally, for prophylactic and/or therapeutic treatment. Pretreatment of skin, for 
example, by use of hair-removing agents, may be useful in transdermal delivery. Suitable 
30 methods of administering such packaged nucleic acids are available and well known to those 
of skill in the art, and, although more than one route can be used to administer a particular 
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composition, a panicular route can often provide a more immediate and more effective 
reaction, than another route, _ ■ * V 

' . ■ ' ' Pharmaceutically acceptable carriers afe determined in pan, by the panicular 
composition- being administered, as well as by che particular method used to administer the 
™5 ■ composition. AccordiriglyTthere is a wide variety of suita&ie^formulations of pharmaceutical 
compositions of the present invention. A variety of aqueous carriers can be used, ^.'g-./ 
buffered saline and the like. These solutions are sterile and generally free of undesirable 
matter. These compositions may be sterilized by conventional, well known sterilization 
techniques; the compositions may contain pharrhaceuticaliy acceptable auxiliary substances 
10 as, required to approximate physiological conditions such as pH adjusting and buffering- 
agents, toxicity adjusting agents and the like, for example, sodium acetate, sodium, chloride, 
potassium chloride, calcium chloride,, sodium lactate and the like.- The concentrarion of 
genetic vaccine vector in these formulations*can van/ widely, and vvill be selected primarily 
based on fluid volumes, v'iscosities, body weight and the like in accordance with the 
15 .particular mode of administration-selected and the patient's needs. ' 

Fonriulations suitable for oral administration can consist of (a) liquid 
. solutions, such as an effective amount of the packaged nucleic acid suspended in diluents, 
such as water, saline or PEG'400; (b) capsules, sachets or tablets, each containing a 
predeterinined amount of the active ingredient, as liquids, solids, granules or gelatin; (c) 
20 suspensions in' an appropriate liquid; and (d) suitab^le emulsions. Tablet forms can include ' 
one or more "of lactose, sucrose, manriitol, sorbitol, calcium phosphates, com starch, potato 

: . ...starch, tragacanth, microcr>:stalline cellulose, acacja,:gejatin, colloidaLsihc^ 

croscarmellose sodium, talc, magnesium- stearate, stearic acid, and other excipients, * 
colorants, fillers, binders, diluents, buffering agents, moisteriing agents, preservatives, 
25 flavoring agents, dyes, disiriteCTating agents, and phannaceutiQally com^ 

Lozenge forms can comprise the active ingredient in a flavor, usually sucrose and- 'acacia or 
. tragacanth, as well as pastilles comprising the active ingredient in an inert base, such as 
gelatin and glycerin or sucrose and acacia emulsions, gels; and the like containing, in 
addition to the active ingreidient, carriers knowri in the art. It , is recognized that the genetic 
30 vaccines, when administered orally, must be protected from digestion. This is typically 
accomplished either by complexing the vaccine vector with a composition to render it 
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' resistant to acidic and enzymatic hydrolysis or by packaging the vector. in an appropriately ' , 

resistant carrier such as a liposome. Means of protecting vectors .from digestion are well 
loiown in the an. The pharmaceutical composuions can be encapsulated, e.g., in liposomes, ' ; 

or in a formulation that provides for slow release of the acti ve ingredient. . - - 

5 . The packaged nucleic acids, alone or in combination with other suitable , ■ ■ ^ 

components, can be rnade into aerosol formulations {e.g.. they can be ''nebulized") to be 
administered via inhalation. Aerosol formulations can be placed into pressurized acceptable 
propeliants, such as dichlorodifluoromethane,, propane, nitrogen, and the like. 

Suitable formulations for rectal administration include, for example, ' ' , 

10 suppositories,\vhich consist of the packaged nucleic acid with a suppository base. Suitable ; , 

^suppositorj^ bases include natural or s>Tithetic triglycerides or paraffin hydr^ In ■ . 

addition, It is also possible to use gelatin rectal' capsules which consist of a combination of 
the packaged nucleic acid with a base, including, for example, liquid triglycerides, ^ 

\ polyethylene glycols, and paraffin hydrocarbons. . ' • ' ^ ' , ' ; • 

■ 15 Formulations, suitable for parenteral administration, such'as, for example, by ■ 

intraarticular (m .the joints), intravenous, intramuscular, intradennal, intra 
subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection solutions, ' 
. which can.contain antioxidants, buffers, bacteriostats, and solutes that render the formulation ' • \ 

isotonic vvith.the blood of the intended recipient, and aqueous:and non-aqueous stenle / ' ■ , ' ' ' j 

20 ■ suspensions that can' include suspending agents, solubiUzers, thickening agem^^^ - • | 

and preservatives. In the practice of this invention,, compositions can be administered, for j 
example, by intravenous infusion, orally, topically, intraperitoneally, intravesically or \ 
intrathecally. Parenteral administration and intravenous administration a^^ 
methods of administration. The formulations of packaged nucleic acid can be presente 

25 unit-dose or multi-4ose sealed containers; such as am ' 

Injection solutions and suspensions can be prepared from sterile powders, I 
granules, and tablets ofthe kind previously described. Cells transduced by the packaged - I 

nucleic acid can also be administered intravenously or parenterally. ] 

The dose administered to a patient, in the context of the present invention '\ 

30 should be sufficient to effect a beneficial therapeutic response in the patient over time. The ! 
dose will be determined by the efficacy of the particular vector employed and the condition 
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of the patient, as well as the body weight or vascular surface area of the patient to be treated. 
The size of the dose also will be determined by the existence, nature, and extent of any 
adverse side-effects that accompany the administration of a particular vector, or transduced 
cell type in a particular patient, 

In~detennining'thTe"ffectiv^ 
treatment or prophylaxis of an infection or other condition, the physician evaluates vector 
toxicities, progression of the disease, and the production of anti-vector antibodies, if any. In 
general, the dose equivalent of a naked nucleic acid from a vector is from about 1 )ig to 1 mg 
for a typical 70 kilogram patient, and doses of vectors used to deliver the nucleic acid are 
calculated to yield an equivalent amount of therapeutic nucleic acid. Administration can be 
accomplished via single or divided doses. 

In therapeutic applications, compositions are administered lo a patient 
suffering from a disease (e.g., an infectious disease or autoimmune disorder) in an amount 
sufficient to cure or at least panially arrest the disease and its complications. An amount 
15 adequate to accomplish this is denned as a "therapeuucally effective dose.'^ .\mounts 

effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations o: the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any 
event, the composition should provide a sufficient quantity of the proteins of this invention 
20 to effectively treat the patient. 

In prophylactic applications, compositions are administered to a human or 
other mammal to induce an immune response that can help protect against the establishment 
of an infectious disease or other condition. 

The toxicity and therapeutic efficacy of the genetic vaccine vectors provided 
25, by the invention are detemiined using standard pharmaceutical procedures in cell cultures or 
experimental animals. One can determine the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population) using 
procedures presented herein and those otherwise known to those of skill in the art. 

A typical pharmaceutical composition for intravenous administration would 
30 be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient 
per day may be used, particularly when the drug is administered to a secluded site and not 
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_ , into the blood stream, such as into a body cavity or into a , lumen of an organ. Substantially 
higher dosages are possible .in topical administration. Actual methods for preparing 
parenterally administrable compositions will be known or apparent to those skilled in the art 
and are described in more detail in such publications as Remington's Pharmaceutical 
Science, 15th ed., Mack Publishing Company, Easton. Pennsylvania (1980). 

The multi valent antigenic polypeptides of the invention, and genetic vaccines 
that express the polypeptides, can' be packaged in packs, dispenser devices, and kits for ' 
administering genetic vaccines to a mammal. For example, packs or dispenser devices that 
, contain^one or more unit dosage forms are provided,. Typically, instructions for 
administration of the compounds will be provided with the packaging, along with a suitable 
indication on the label that the compound is suitable for treatment of an indicated condition. 
For example, the label may state that the active compound within the packaging is useful for " 
treating a particular infectious disease, autoimmune disorder, rumor, or for preventing or 
treating other diseases or conditions. that are mediated by, or potentially susceptible to, a 
mammalian immune response. 

EXAMPLES 

. The following examples are offered to illustrate, but not to limit the present 

invention. 

Example 1 

Development Of Broad-Spectrum Vaccines Against Bacterial Pathogens And Toxins 

A. Evolution of Yersinia V-antigens 

This Example describes the use of DN A shuffling to develop immunogens 
that produce strong cross-protective immune responses against a variety of Yersinia strains. 
Passive immunization with anti-V-antigen antibodies or active vmrnunization with purified 
V-antigen can provide protection j&om challenge with a virulent autologous Yersinia species. 
However, protection against heterologous species is limited (Motin et al. (1 994) Infect. 
Immun. 62: 4192). 

V-antigen genes from a variety of Yersinia strains, including serotypes of Y. 
pestis, Y, enterocolitica, and Y, pseudotuberculosis are subjected to DNA shuffling as 
described herem. The Yersinia pestis V antigen coding sequence, for example, is used as a 
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query in a database search to identify homologous genes that can be used in a' family ' 
shuffling format to obtain improved antigens. Results for a BLAST search of GenBank and 
EMBL databases are shown in Table 1, in which each line represents a unique sequence 
entry listing the database, accession number, iocus name, bit score and E value. See, Altschul 
"5 erar{X99iyNucleic^ia^drR^^ ^ 
Homologous antigens have been cloned and sequenced from a number of related yet distinct 
Yersinia strains and additional natural diversity is obtained by cloning antigen genes from 
other strains. These genes and others or fragments thereof are cloned by methods such as 
PGR, shuffled and screened for improved antigens. 



10 

Table 1 



j Sequences producing significant alignments 

t ■ ■ ■ ^ ' , - 


' i ■ ' 
\ 

! ■ 


Database/Accession No. j Gene t Score 

^ Ifbits) 


E 

Value i 


gbjM26405|YEPLCR j Yersinia pestis IcrG, icrV\ and IcrH genes, 

i CO 


1 1945 


j 0.0 


gb|AF053946|.AF053946 1 Yersinia pestis plasm.id pCDl, complete pia 

i 


1945 


0.0 


! embjX96802|YPTPIVANT | pseudotuberculosis V antigen gene 

j ■ ■ 


1834 


.0.0 


gb|M57893|YHPLCRGVHP 


Yersinia pseudotuberculosis V-antigen 


1818 


0.0 


gb|AF080155|AFOS0155 


Yersinia enterocolitica pYV LcrV (IcrV) 
, antigen . 


1723 


0.0 


emb|X9680 1 |YE96P VANT 


_Y, enterocolitica V antigen gene, sirain Y.-:... 


-1667 


.0.0 


emb|X96799|YE108VANT 


Y. enterocolitica V antigen gene, strain Y-... 


1659 


0.0 


emb|X968001YE527VANT 


Y . enterocolitica V antigen gene, serotype ... 


1651 


0.0 


emb|X96798|YE808VANT 


. y. enterocolitica V antigen gene, strain 
8081 


1643 


0.0 


emb|X96796|YE314VANT 


Y. enterocolitica V antigen gene, strain WA. 


1237 


0.0 


emb|X96797|YENeTVANT 


Y, enterocolitica V antigen gene, strain 
NCTC 


1221 


0.0 
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Sequences producing significant alignments . 






Database/Accession No, 


Gene 


Score 
(bits) 


E 

Value 


gb|S^S727|S38727 


IcrGVH operon: lcrV=V-antigen [Yersinia , j 
pseudo,] ' , \ ' * ■ ! 


'365 


v9e-99 j 



Shuffled clones are selected by phage display and/or screened -by ELISA to 



identify those recombinant nucleic acids' that encode polypeptides that have multiple 
.epitopes, corresponding to the different. serotypes. The shuffled antigen genes are cloned into 
afilamentous phage genome for poly\'aleni phage display or a suitable phagemid.' vector for. 
5 monovalent phage display. A typical protocol for panning antigens by phage dispFay is as 
follows. • ' . 

- ' • Coat an appropriate :surface (e.g. , Nunc Maxisorp tube or multiwell plate) 

at 4°C overnight with the target antibody,, usually at a concentration of 1- 
10 )ig-'mi in-PBS or other suitable buffer 

10 ' • Rinse and Block with PBSM (PBS nonfat dr>^ milk) at 37X- for 1-2 

hr : .'. , 

• Pre-block priage if needed (PBSM, RT 1 hr) 

• Rinse tube and allow phage to bind (usually 1 hr @ 37''C) 

• Can var>' time, temp, buffer, add a competitive inhibitor, 'etc. 

15 • Wash extensively (15x) with PBST (PBS + 0.1% TWEEN20),, then PBS 

• Elute bound phage with low pH (e.g., 10 mM glycine), 100 ruM 
triethylamine, competitive Ugand, protease, etc. and then neutralize pH if " 
rieeded. 

' . • Infect E. cbli with eluted phage to transduce expression phagemid into 

20 new host. Titer and plate for colonies on drug plates 

• Pool colonies into media, grow cells and infect with helper phage to 
produce phage for next round 

Phage ELISA assays are a useful method to rapidly evaluate single clones 
25 ' after panning of libraries. Single colonies are picked in individual wells of a multiwell plate 
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containing 2YT media and grown as a master plate. A replicate plate is. infected with helper 
phage and grown so that phage from a single well will display a single antigen variant. A 
suitable protocol for phage ELISA assays is as follo\vs. . 

• Coat microtiter plate with 50 ul of 1 \ig^'ml target antibody 4 °C overnight 
5 • Rinse and block with PBSM for 2 hrs @ 37 ^^C 

• Rinse, add preblocked phage and allow to bind 1 hr @ 37 °C 

• Wash plates with PBST 3x, then PBS 3x with 2 min soaks 

• Add HRP(or AP)-conjugated anti-M13 antibodies for 1 hr @ 37 °C 

• Add substrate and measure absorbance 
10" • Identify positive clones for further evaluation 

ELISA assays can also be used to screen for individual antigens with multiple 
epitopes or increased expression levels. Single colonies are picked in individual wells of a 
multiwell plate containing appropriate media and grown as a master plate so that antigens 

1 5 produced from a single well are a single antigen variant. A replicate plate is grow u and 
induced for protein production, e.g., by addition of 0.5 mM IPTG for Lac repressor-based 
systems and grown for an appropriate time for the antigen to be produced. At this point a 
crude antigen preparation is made which depends on the antigen and where it is produced. 
Secreted proteins can be evaluated by assaying the cell supematants after centrifugation. 

20 Periplasmic proteins are often readily released from cells by simple extraction into hyper- or 
hypo-tonic buffers. Intracellularly produced proteins will require some form of cell lysis 
such as detergent treatment to release them, A suitable protocol for ELISA assays is as 
follows. 

• Coat microtiter plate with 50 [x\ of 1 )ag/ml target antibody 4 ^^C overnight 
25 • Rinse and block with PBSM for 2 hrs @ 37 °C 

• Rinse, add antigen prep and allow to bind 1 hr @ 37 °C 

• Wash plates with PBST 3x 
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o Add HlU^(or A?)-conjugated secondaiy antibody and incubate for 1 
37 °C 

o Add appropriate substrate and measure absorbance 

o Identify positive clones for further evaluation 
5 Antibodies specific for many of the various antigens are commercially ' 

available (e.g.. Toxin Technology, Inc, Sarasota, FL) or can be generated by immunizing 
suitable animals with purified antigens. Protein A or Protein G Sepharose (Pharmacia) can 
be used to purify immunoglobulins from the serum. Various affinity purification schemes 
. can be used to further purify family-specific antibodies if needed such as immobilization of 
10 specific antigens to NHS-, CNBr-, or epoxyractivated sepharose beads;. Other related 
antigens may be included soluble form to . prevent binding- and immobilization of cross- 
reactive antibodies. 

The multivalentVolypeptides that are identified by the initial screening 
' protocol are purified and subjected to in vivo screening. For example, the shuffled antigens 
15 selected by a combination of any or none of these methods are purified and used to. 

immunize animals, initially mice, which are then evaluated for improved immune responses: 
Typically 10 micrograms of protein is injected to a suitable location with or without 
appropnate adjuvant, e,g., Alhydrogel (EM Seargent Pulp and Chemical, Inc.) and the 
animals are boosted with an additional dose after 2-4 weeks. At this point serum samples is 
20 drawn and evaluated by ELISA assay for the presence of antibodies that cross-react against ■ 
multiple parental antigens. In this ELISA assay format the antigens are coated onto 
' multiwell plates, then serial dilutions pf each sera is allowed to bind. After washing unbound 
antibodies, a secondary HRP- or AP- conjugated antibqdy directed against the appropriate 
test antibody constant region, e.g., goat anti-mouse IgG Fc (Sigma) is bound. After another 
25 washing, the appropriate substrate is added, e.g., 0-phenylenediamine (Sigma). The 

absorbance of each well is read by a plate reader at the appropriate wavelength (e.g., 490 nm 
for OPD) and those producing high antibody titers to multiple antigens are selected for 
further evaluation. 

Additionally, the ability of antigens to generate neutralizing antibodies can be 
30 evaluated in an appropriate system. Antigen variants that elicit a broad cross-reactive 

response are evaluated further in a virulent challenge model with the appropriate pathogenic 
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organism. For example, the multivalent polypeptides are used to immunize mice, which are 
then challenged with live Yersinia bactena. Those multivalent polypeptides that protect 
against the challenge are identified and purified. 

B Evolution of-broad-spectrum-vaccines-a gaiDSt-bacterial-to 

This Example describes the use of DNA shufrling to obtain muitivaJent 
polypeptides that are effective in inducing an immune response against a broad spectrum of 
bacterial toxins. . 

L Staphylococcus 

The Group A Streptococci, which can cause diseases such a.s rood poisonins, 
toxic shock syndrome, and autoimmune disorders, are highly toxic by inhalarion. The famiiv 
of Group A Strepiococcus loxms numbers about 30 related members, maxmr^ (his group a 
suitable target for family shuffling. Accordingly, this Example describes the use of family 
DNA shuffling to create chimeric proteins that are capable of eliciting broad spectrum 
protection. 

Nucleic acids that encode many diverse attenuated toxins are subjected to 
DNA shuffling as described herein. Table 2 shows the output of a BLAST search of 
GenBank, PDL, EMBL, and Sw'issprot using the S. aureus enterotoxin B protein to identify 
homologous genes that m.ay be used in a family shuffling format to obtain improved 
antigens. 



Table 2 



Sequences producing significant alignments j 

1: 
j 


- ■ . '\ 
\ 


Database/ Accession No. 


Gene [ Score 

i(bits) 


E . j 
Value \ 


splP01552|KTXB 5TAAU 


cNTriROlUXilJ TYPE H PRECUkiaOK li^KH) >d 


£>54 


e-157 ] 


pdb|iyE3|. 


bcapnyiococcai iinterotoxin H Complex^ 
Tri ... 


S04 




paD|ih;iiB|U 


bcapnyiococcus aureus >gi | Ib33i48 | oai 
Staphyl ... 


40e> 


■"e-115 


sp 1 p-iJilJ 1 KTCJ STAAU 


ri:;NTKKUTOXiW TVPl:: C-3 PRECTO^Ok 


375 


e-103 


sp|P0155i lETCi yTAAU 


[ KNTKHUTOXiW TYPE C-1 PREOJUyOk (SECT) 


358 


e-iOl 


sp|P34 071|l::TC:2 STAAU 


HNTEROTOXiN TYPE C-2 PRKCUk^UR CSEC71 


351 


"^6-99 


gi|29514b 


tiji3376j ent:erocoxin L^inapnyiococcus 


338. 


^e-92 


gi|295151 


lLiii7y ) enterotoxin Lb'tapnyiococcus 




ie-90 


gi|29Eil4i 


enterotoxin ibtapnyiococcus 


330 


4e-yo 


gi 129514,9 


lLi337»j enterotoxin Li:itapnyiococcus | 329 


ie-89 


pdb|lJCK|B 


unain B, i-L'eii Receptor Beta L'ham 0 328 
With S.'. . 1 


Je-89 


g 1-1-2-9 SI 41 


entrerot^ox-i-n— i-^taphy-l-ococcus-i-3-2B j-ie-^^B'^^^ 
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Sequences producing significant alignments 




■ 


Database/Accession No. 


Gene 


Score 
(bits) 


E 

Va!ue 


j pat>| ibL'^ 1 


stapnyiococcai ' bncerocoxm , Monocj ^27 
Ente - . 1 


4e-89 


gi| 190605^ i [u9lb2c} type c enterotoxxn ibtapnyi 

•\ 1 incermed. . . ' - , ' , ' 


: 326 


Be-89 


gi 1295147 . i (L1JJ77J enterocoxm tb'capnyiococcus 


323 1 7e-88 


bbs| 155101 


enterotoxin=pyrogenic toxin istapnylc 
4446,. P. . . 


319^ 


le-8^6^ ' 


gi|476764 ' j [L23i36bJ superancigen ^streptococcus 


311 


3e-y4 


gi|124Si72 


tU4b7y^) superancigen bSA iscrepcococ 
• pyogenes] > . . . 


310 


4e-94 


gi|1245174 


(U4e7a3j superantigen SSA istreptococ 
pyogenes] 


309 


16:83 


sp|P0H0y5|SPEA STRPY 


EXOTOXIN TYPE A PRECURSOR (SCARLET FI 


225 


2e-5S 


gi|4V^bb 


(X51560; type A exotoxin [streptococc 
pyogenes] >gi | . . . 


211 


3e-54 


pir 1 |i;i8753 


exotoxin type A precursor (allele 3; 
Streptococcu ... 


'21^ 


4e-54 1 


pirljS187B6 1 exotoxin, type a precursor (aiieie 2) ! 20^ \ 2e~b3 \ 

1 Streptococcu . . . I ■ \ { 


\ pir \' \'S1B7Q9' ; exotoxin A precursor iaiXeie 4; - 5t:j 206 f Se- .-:;3 i 

. , j pyo ... ■ I 1 ; 


gil4732B ■ 


iA5lbb4j type exDCcxm. t d t rept ococc l^^c ; ^e-bO 
pyogenes) |^ 1' . ' 


pir ! iA'26ib2 ■ 


streptococcal pyrcgenic exocoxm typ^ 185 l 2e-'46 
_ precursor - . . . j j 




sp 1 P131'gTl STXA_5TAAU- f KMTtlkOTOXlN TVPt: A PRECURSOR (SEA) >^ 129 ! 2e-29 •! 


prt 1 1 i7042UiA i encerocoxm a Lbcapnyiococcus aureus^ i2ti 


3e-29 ' 


pdb|lIiyFiA j i^taphyiococcus aureus >gi i 15 J J^J J | paJ I2b 

. 1 Staphyl ... ■ j 


2e-28 


.piri !A29b66 i enterotoxm A - stapnyiococcus aureus i^b t ^e-2ti 


sp|P12yy3 lETX!:: j^taau i 


i^NTKRUTOXlN TYPa E PRECURSOR . t SEH ) >c 


118 i 3e-26 


gii5106y2 j 


(U11702) enterotoxm h LStapnylococci 
>gii 10. . . 


' 


7 e - 2 0 


gi 1149047 


(M94a72) enterotoxm D IPlasmid pIB4^ 


39 


2e-17 


gi|25a9553 


(U93688i enterotoxm iStapnyiococcus 


76, 


2e-13 


gi- 1537ijb 


1M97156) pyrogenic exotoxin u istrept 
pyogenes ... 


57 


Be-OB 


sp|PI3380 |SPEC ^TRPV 


EXOTOXIN TYPE C PRECURSOR iSPE C) 


57 ■, be- 08 


gi 15297^4 - 


(U02555; Spec istreptococcus pyogenes 


55 |2e-07 


pir||A3050 9 


exotoxin c precursor - streptococcus 

.>gi|i-.. , , 


56 


Je-Q7 


gi|52y7b5 


tU02bb0) spec istreptococcus pyogenei 


5b ! 4e-07 


pirl|S27'^40 


enterotoxm b - staptiyiococcus 
aureus (fragments) 


53 1 le-Ofe 



Shuffled recombinant clones are initially selected by phage display and/'or 



screened by ELISA for the presence of multiple epitopes from the different families. Variant 
proteins with multiple epitopes are purified and used to for in vivo screening as described 
above. The mouse sera are analyzed for antibodies specific for different toxin subtypes and 
variants that elicit broadly cross-reactive responses will be evaluated further in challenge 
models. 
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2. Escherichia coli and Vibrio cholerae 

This Example descnbes ihe use of DNA shuffling to obtain cross-reactive 
muhivalent polypeptides that induce an immune response against the E. coil heat-labile toxin 
(LT), cholera toxin (CT), and verotoxin (VT). Nucleic acids that encode cholera and IT 
— toxin-B-chains-are-subjected-tG-DNA-shuffling. Table-3-shows-the-results-of a-BLAST search- 
^ using the V, cholerae toxin B-chain to identify homologous genes that can be used in a 
family shuffling format to obtain improved antigens. Homologous antigens have been cloned 
and sequenced from a number of related yet distmct Vibrio and E. coli strains, and additional 
natural diversity can be obtained by cloning antigen genes from other strains. These genes 
and others or fragments thereof can be cloned by methods such as PCR, shuffled and 
screened for improved antigens. 



Tables 



Sequences producing significant alignments j 

r 
f 




Database/Accession No. | Gene | Score 

1 1 (bits) 


E 1 

T illUC 


spiPUibbo iCH'iH_VI3CH , C'HOLKiiA ENTEROTOXIi: , 3STA GIAIN P.^^ECII 25^' 


5 e - 6 7 


gificayu j ixbBVb^; cholera toxin B protein 

\ cholera . . . 


248 ■ 


be-66 


gij/bdJbi 1 1X00171] ctx d iVibrio choieraej 


24b 


Je-6b 1 


prt t 1 lOuiiyfaA j toxin, cholera IVibrio cnoxeraej 


24S 


3e-55 1 


t gni t Piu 1 aiuu68b J | tujuob^; cnoiera toxin [Vibrio cnolej| 


244 


le-64 ! 


: pirllxwuB 1 cnoiera enterotoxm cnam B precursoi 

j cholerae 


"241 


le-bj 


gij^uybbb 


iM^jObUi cnoiera coxm suDunit d prec 
[Artificia. . . 


223 


/ e - 6 0 


bbs 1 li>BUOb 


cnoiera-iiKe enterotoxm B suDunit \\ 
cholerae, . . . 


211 


j.e-54 


spiPiJBlllKLBH ECOLI 


HKAT-LAHILH HWTKROTOXIN B .CHAIN PRKCq 


2 05 


be-b4 


pdbilXTC|D 


vicrio cnoierae >gi | 1S278 bl t pdb | IXTC 
choler ... 


2 07 




pdb|lFGB|U 


viDrio cnoierae >gi { 194 2 83 3 | pdb | it'CJB 
choler ... 


207 -- 


2e-5i 


pdb|:<!CHB|D ■■■ - 


unam u, cnoiera Toxm B-Fencamer cor 
With Gml. . . 


207 


:^e-bi 


pdbj iCHP|U 


viDrio cholerae >gi ( 1421bli | pdb j ICH? 
choler. . . 


■■2'05' 


le-b:^! ^ 


pdb| ICHQIU 


viDrio cnoierae >gi | 1421d-: 5 | pdb | ICHO 
choler. . . 


20S 


ie-b2 


, paDj ICTl |U 


cnain D, cnoiera Toxm B-Pentamer Mut 
Bound . . . 


204 ^ 


ie-b2 , 


sp| £332890 |i:;LBP ECOLI 


HEAT-LABILE ENTEKOTOXIN B CHAIN PUECl 


204. 


2e-32 I 


prt 1 1 UV0i264A 


toxin bTB cistron,neat laoiie Lnscnej 
coli] 


201 


""ye-b2 


pir 1 tULbCH ■ ■ 


neat-iaDiie enterotoxm cnain b precv 
Escheric ... 


201 


' 2e-51 


bbs| 1314yb 


tS6073lj neat-iaDiie enterotoxm B si 
B su. . . 


200 


■2e-bl 


prt 1 |770iyUA 


toxm LViDrio cnoierae j 


199 


be-51 




Escnericnia coii >gi j 4y4^bb | pab I ILTA 


179 


4e-45 




Escherichia c . . . 
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Sequences producing significant aligriments * 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


pdh|lTET|p 


Vidrio cnoierae 


34 


0,21 . 



Those chimeric toxins that elicit high levels of neutralizing. antibodies against 



■ both toxins and have improved adjuvant properties are identified. For example, shuffled 
clones are selected by phage display and/or screened by ELISA assays for the presence of 
epitopes from the different parental B-chains. Variants with multiple epitopes are purified 
5 and further studied for their capacity to act as adjuvants and to elicit cross-protective • • , 
immune responses in challenge models. ' ^ 

Example 2 , 
Evolution of Broad-spectrum Vaccines against i?£?rr{?//aAwr^£^£?r/m 

Lyme disease is currently one of the rastest-growing infectious diseases in the 

10' . United States. It is caused .by infection of the spirochete bacterium Borrelia burgdorferi, • • ■ 

which is'carried and spread by the bite of infected ticks. Early signs of infection mcludeskin 

rash and flu-like symptoms. If left untreated Lyrrie disease can cause arthritis, hea'rt / 

abnormalities, and facial paralysis. Treatment of early Lyme disease with antibiotics can stop 

/ . the infection, but a lasting immunity may not deyelop making reinfection possible. A'eurrent - 

■ 15 ''Vaccine requires three immunizations over a 1 -year penod to acquire immunity. ■ 

■ ' * Both passive and active immunization with .the purified B. burgdorferi outer 

, surface protein A (OspA) protein- has been successful in proteciing against infection with'^; ; 

^ burgdorferi, h\ii has no effect against ongoing infections, since this antigen is not expressed 

in vertebrate hosts. OspA is normally anchored on the outside of the cell by a covalently 

20 ' attached lipid moiety through an amino terminal cysteine residue. In contrast, the outer 

surface protein C (OspC) is highly expressed by the spirochete in vertebrate hosts and 

vaccination of infected individiuals with OspC may be an effective therapeutic in curing the 

' inftciion (Zhong et al. {\991) Proc. Natl Acad. ScL 94 12533-12538. 

' ArecentBLAStsearch(Altschul, era/.,' (1997) Nucleic Acids Res. 25:3389- 

25 3402) of the non-redundant GenBank, PDB, SwissProt, Spupdate, and FIR databases was 

used to identify homologues of the OspA outer surface protein gene. This resulted in the 

identification of over 200 entries related to OspA. One, hundred entries are shown in Table 4 

below from different strains of B. burgdorferi, B. garinii, B, afzelii, B, tanukii, and B, turdi 
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that share at least 83% DNA sequence identity to the Borrelia burgdorferi OspA protein. 
The ospA genes from these and other strains provide a source of diversity for family 
shuffling to obtain improved antigens for the prevention of Lyme disease. These genes ari 
cloned by methods such as PCR, shuffled and screened for improved antigens. • 



Table 4 



j Sequences producing significant alignments 


I 




1 Database/Accession No. 


j Gene 




Score 


E 




i 




(DitS) 


Value 


4 




1 :iorreiia sp. gene tor outer surtace 

•| proce . . . 




1629 




ro 


irtu./xo J ^ o ! rt^uio> /a [ aorreiia sp . iO.MT gene ior outer ^ 

i surface ... . " - 




I 0 

1 

1 


. V 




1 Dorre^ia curai gene tor outer surtace" 
; pro ... 


j 




i U 


. L 


OO 3 ; Ab ulb / □ i A-b Jib s / b 


1 ::orreiia sp. gene tor cuter surtace " 

! prote ... 


) 


- 0 


f 
i 


i- 




f cspA=ou:er surtace prccem A IzjcrreTia 
: burgdor ... 


( 






I 


go 1 L J b 6 ^ j bOxr KA 


, ::crre^:a ourgaorteri \zione !JJi cspA ~ 




'948 


' u 






eruJD i XoUi o6 i enr' IKOUrb 


= . ourgoorteri x^i ro ospA gene 




■yjy 


U 






embUbbSyairjiiUiirAl " " 


f o.curgdorteri usp A gene ii'kOi 












; -orre-ia ourgaorteri L'-i-il outer 

I surface pr . . . . • , 


F 
1 


6'Jb ■ 




w 






! outer surtace protein A iHorreiia'""" 


i 


dby 


0": 


\J 




emr> i XooOb b t csiiuii^KOrt 


1 a. ourgaorteri uspA gene tor outer ' ~ 

1 surtace p. . . 




O J 3 j 


0. 






emo 1 xyb4-;U i H^l'iiiiJSrA 


! d.garinii ospA gene (i'.si substrain ; 






■J . 


"0" 




enibiXBb43y IHAPLJUaPA 


; tj.atzsiii ospA gene iPLi^ substrain; ' 

>gi i 5. , . ^ - _ 




dJ7 


o; 


0 




embiXUCiyj iHiJPHOOS?:r " 


b . ourgaorteri i^bo ospA gene : 




0. 


u 




embU^yOBViaHOSPAV 


ourgaorteri ivij4biJ GspA gene tor : 

ou t e r s u . , . 




a. 


0 




emb i X y b y y -J 1 ba::nads?a"" 


o.atzeiii ospA gene ' " ~ 




621 


c. 


0 




einb|X&21£i|HiiOi;iiAG ' "" 


ri . ourgaorteri piasmid ospA gene tor "7 — l 
outer su . . . * ' i 




u . 


"0 — 




gblU'/BliOliHHUVHJOl ' " 


borreiia arzelii major cuter rieraorane 
surf ac ... 


'621 




-0 -j 


emblXeSBSJy |ytlUi>VA2 


b. ourgaorteri dsp A gene""TPKCrj '■ 


aiy 






etTib|X7UJ&b IHBUPSAA 


b-ourgacrteri OspA gene — 


"Hiy 1 


■j: 


^1 


emb 1 XHb43 y i UAPLUU^PA 


d.aizeiii ospA gene IPLud substrairr; ' 


yij 


0 . 




emb I XH iU4 7 I ISBOFST; 


b . Ourgaorteri pxasmia uspA gene " 




y:j . 




U ; 


1 u^ujao 1 oAuzujDb j ijorreiia arzeiii bVi cuter surtace ] 

! protein A. . . ■ j 


0. 


13 ■ 






d.atzeiii ospA gene IPHo substrain) — ' 1 


vy; 


o: 


■0 




embjXbO:^bi liJHfWUULL 


B .ourgaorteri PWudil ospA gene '. ' 


vyi 


0. 


0 ' ■ 




einblZJyOHfe IB3USPAX' "■" 1 


o. ourgaorteri [^^^) uspA gene tor outer 
surf .... 


";yi-. 


c: 


0 




gblLiy/O^jHUKMAJOyp;^ 


borreiia ourgcorreri outer surtace 
protein. . . 


vyi 


0. 


0 




embiXb^JBV lUH^PA- 


b. ourgaorteri ospA gene tor outer 

surface prot ... 


yyy 


0. 


0 — 




erublXfctJJOUIUBASPA ' 


b. ourgaorteri gene tor UspA outer 

surface pro. . . 


■VBb 


D' 


0 




embjXb-ifc^4 IHBK4B0SPA ' 


b. ourgaorteri ospA gene " ^ 


7 b / 


cr 


u 




gb|MBbVt4 laUHOSPSHj; 


Horreiia ourgaorteri operon major outer 
mem. . . 


vby 


0 


u 




embiXtJ4i:^|HHPUyPA 


b. ourgaorteri piasmia ospA gene t"6r 
outer su. , . 




0 


u 




go'i-b J b o-J t-|-HUUUisVAb 


borrei-ra ourgaorreri outer surtace. 
protein A. . . 


■■74J 


D 


0 
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j Sequences produciQg significant alignments j. j 


Database/Accession No. 


j Gene 


Score 
(bits) 


E 

Value 




borreiia garinii lv4 outer surrace 
protein A. . . , 


74J. 


■0.0 




j b.ourgaorieri ospA gene (2.y / suDstram) 


714 


j o.u 


gb}La97qi jHOKOPisAii 


borreiia Durgaorreri rria^cr oucer 
surface pro, . . ' 


714 


j 0.0 . 

r ■ 




^borrexia Jourgaorreri ospA ana ospb 
genes for. . . ' 


70o 


U . 0 1 


. go 1 Ac;OUi; /yo 1 AL'COPTSO 


borreiia ourgaorreri piasmia ip34, 
' complet . . . ' , . , ' 




u . u 




i borreiia .ourgaorreri b"-.i - iU^ outer 
i surface pr. : . 


i 706 




emblX&ytOfc jHHKAUij'tJA ■ - 


1 b . Durgdorr era uspA gene 

1 >gi [1819262 jgb{ I264-. : , ' 


VOt) 


. 0 . p 




! .bprreiia garinii gene tor outer surtace 
j p. . ■ . 




j U . 0 

1^ , . ■ 


.go 1 Mb 72 4,0 j BOROy PA 


b.JDurgaorterei outer surrace protein A 
'(OspA)... ' . 


19U 


0.0 

1 


emblXaOlU.^J.KBPKAOPS'A 


1 b.Durgdorteri t^Ka ospA gene 






or-]-! ABUU/^ui j AiiOOVlCl . 


borreiia garinii gene tor outer surtace 

p. . , • , 


6 "^4 


0.0 ■ 

i 


emJD i ASb44 J j bsT^bUiDt'A 


b.ourgaorr en ospA gene ( r2b:> 
substrain) 




u . *J . ■ 1 


etuD j X. i 6 4 b 7 I o £jOS i^A 


Borreiia ourgaorreri vspA gene tor \ 
outer surf ... - , ■ - | 


.650 I 
i 


■ , i 




borreiia sp. i^Vb outer surtace protein i 
A pre .... ' ' • i 


6 5 0 j 


'J . 'J i' 




borreiia Durgaorteri zO piasTad ' j 
lipopr . . . ' ' ' ■ ! 


i>U2. j 


'J.J " 


emD i Xbb / J b ; ^dbU.NAUiiPA^ ' 


b . Durgaorreri ospA gene ■ 




0.0 . ■ 


emo |xab441 1 aGWAHUSi^A ^ 1 


b .garinii _ ospA gene vwabsou suDstram/ •[ 575 ; 


0.0^ 


gD|U95voy ( uyivoy ■ j 

■ ■ . * ! 


borreiia garinii outer surrace protein j- 
A (ospA, . . * * 


&74 


0.0. 


ODj i ABOOVUyg ] A3007Dg5 ! 

■ ■ ■ ' ' i 


borreiia garinii gene tor outer suriace 
P ■ * . ■ , 




0.0 




B .Durgdorreri t'Hei ospA gene 


56 U' ' 


0.0. 


gD|U4yiyo i b<Ju4yiyo . | 


borreiia garinii rr.aicr outer nemr>rane 
surf ac , . : ' ■ . • ' 




0.0 


emblXSSSOUIHHOyyAS | 


B .Durgaorreri usp a gene. {Hil) . 


6b2 j 


0.0 , 1 




borreiia ourgaorteri gene ror 
outersurface pr. .'. 


55'i 


0.0 


af)3 1 AbUibyvs j AH015575 


Borreiia vaiaisiana '^ene tor outer 
^surf ac .... 


,54 5 


0.. 0 


db3|AHOU;iUyiAiiOU71Gy . 


borreiia garinii gene ror outer surtace 
p.'... 


545 


0.0 


gt>tU9J7U7 |Uy3707 , j 


Borreiia garinii outer surrace, protein 
A {ospA. . . 


544 


0.0 


db3 [ABOO 1 AHOOVIU:^ 


Borreiia garinii gene ror outer surtace 
p.. . . * \ 


54:<i ' 


0 . D 


■db] 1 ABUUlU4i |:AHpoib41 . i 


Borreiia garinii dna ror outer surtace 
pr. : . 


536 


e-iyo, . 


. db] iAHUC7ii4 iABUC7114 - - 


-borreiia garinii gene • ror outer surtace^ 


53'i 


e-179 ^ 


dbj lAHOO/iUSJAUOOTIDS 


borreiia garinii gene ror outer surtace 
p. . . 


632 i 

i 


e-i/y 


gblUyj71UlUy3710 • , 


borreiia garinii outer surrace protein 
A (ospA. . . 


52B 


e-i/b 


■ dbj IABU071U5 l ABOUnuS--' 


Borreiia garinii gene tor. outer surtace 
p. . . 


o2h 


e - 1 / / 


dbj |AU00;iU4 IAB007IU-4 


BOrreiia garmii gene ror outer surrace 
p . . . 


524 


e-i77 


gb|Uyj70t |U5370r 


Borreiia garmii outer surtace protein 
A (OBpA. . . 


'h'20' 


e-176 


emb|xau:ibb I UUl^UJiOSFA 


B.garnii var ospA gene 


51j 


e-i7i . 


db] iAH00;XUU|AliUU7iUy 


Borreiia garmii gene ror outer surtace 
p. . . . 


60 7 


e-171 


gblLBll^iy mu;i05i;5AY 


Borreiia Durgaorreri (isolate 2-i4yy 
297) ou. . . 


fcOb 


e-171 
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Sequences producing significant alignments 



Database/Accession No. 

gD| uyj vil t uyj'/il 



Gene 



borreiia gari.-ii cucer suriace protein"" 
A (ospA. . . 

Borreiia garinii gene :or cuter suriace 
p. 



Score E 
(bits) I Value 



e-171 



gb|L^jij;|KUi^OSPAC- 
gb|i.Jjlj!j;HCKO!jtJAr 



goiL^^l'lJ liJUkOisi^A: 
go|LJJiib [HOi^OiiyAA 



iiorreiia burgaorzeri i isolate ^r-i49y \ hB 

Son 188 . 

Borreiia Durgaorteri (2 /'i?obL"i'Jj OspA 
gene, 3 . . . 



Borreiia Durgaorreri {4:^j;jNVj; OspA r 5 7 7 

gene, 3 . . . I 

Borreiia Durgdorteri iCAJ) OsoA gene; 
' 3 ' end . . . ^ 



gb|Uyj/'Ub|u93703 

gbJL:jJ14 0 jHOf^Ub ' PA.t- 

I db] |ABOO';ilO|AnOU;iiO 



! borreiia curgaorteri (Biy:.""!) OspA 

gene., 3 ' e . . . 

borreiia garinii oucer surrace crotein TTST" 

A (ospA. . 



37T 



e-i6z 



e- 160 



I Borreiia Durgacrieri 14iD5^M=ij OspA 
j gene, 3 • . . 



i borreiia garinii aene tor outer s-^rtace" 

I P- - . 

I borreiia Durador: er ; 
3 • end 



■v^'spM gene. 



oDj ]AbUu.x..:,MbuvVx^^- 1 Horrelia garxnii gene tor cuter surtace ; bSV e-iho — ' 

I p. . . ' i 
1 emDiAbu-iH4|i:;;;Tib03?A ! b.garnii TJd osoA gene ^ TTS- k-^" 


gD|u:yj/ub|Ui^j7ua f Horrelia garinil outer surtace crote- n TT^l ! I.T^s — ^1 

. { A {ospA. . . * i 1 ' " I 


tAbuu/iJ^jAJUUVlOV ; borrelia garinii gene tor outer surta^^ i 545 K--54 i 




borreiia ourgdorteri \2iSi'yfil} OspA 
gene, 3 ' . . . 


34b 1 e-ibi 

! ' 


003 1 abUU ;iil j Ai?0U71Il 


' borreiia garinii gene :cr outer surtace ! b4i • ' e-Ia^ 
p... . j { 


dbj |AHU071.j>|AaU07113 


borreiia garinii gene :cr outer surtace ! 317'": — : e-15- ' 

p... i ^ j. 


gbjL2J14 4,t!UHUi;PAK ' 


borreiia ourgaorieri \{,A6; 'OsvA gene, : i"e^TTo — 
3 ' end ... ' ! j- 


9b|U7yb4y|BAU75D2r5 


borreiia arzeiii ma}or cuLer membrane ; 
surf ac ... i 


■-e-147 


db3 tAHOOyyijJ jADOOgBoj ' 


borreiia garinii gene rcr cuter surtace ToT^ 

p. . . ; ' 


6-145 


embjxtao^jy |yauypAGZ — 


b.curgaorieri ospA gene tor outer 

surface p. . . 


4yd 


e-ljy 


db] |AH00S^bb^iAHU0yB52 

emb|XybJoU IHHUSPPFRA ' 


borreiia garinii gene ror outer surtace ""' 

p.-. - - ■ . 

b.Durgaorreri ospA gene (strain Pr'raJ " 


4bb 


e-iyy 


etnb|Xfc8b41|yiilJHKi 


b.ourgaorteri IPHKiJ plasmid OspA gene 
for ou, . . 


460 

445 . ■ 


e-i;r/ | 
e-i2j 1 


emb|Xba54U|UUywUUI 

emb|XybibH|HGUSPFLT 


b.DurgaorEeri ifWudl) plasmid OspA gene 
for ... 

b.garinii ospA gene (strain PLi) 


414 
■'414 


e ■• IT^ "\ 


db3 lAHuoydtojAiiouaaso ■■ 


borreiia garmii gene tor outer surtace""^ 
p. . . 


' jyj 


'•■e-114'- 

e - 1 0 '/ 


db] lABOoyybu 1 Aia'joyBbii 


borreiia garmii gene ror oucer surtace I JFd 

p... j 


oe-y5 


dbj |AiiO'uyBbijABUUyH61 ■ 


borreiia garinii gene ror outer surtace 2dj 
p. . . 


■■"^e-V4 



A BLAST search with the B, burgdorferi OspC protein gene revealed over 
200 related entries. Entries for one hundred sequences sharing at least 82% DNA sequence 
identity are shown in Table 5 below that provide a source of diversity for family shuffling to 
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obtain improved therapeutics in the treatment of Lyme disease! These genes are cloned by 
, methods such as PCR, shuffled and screened for improved antigens. ' 



Table 5 



Sequences producing significant alignments 


1 


I i 


Database/Accession No. 


Gene , 


Score 
(bits) 


1 E j 
1 Value 1 


1 


iJorreiia burgaorteri (JMi^ synthenase . 
(guaA) g . . . ■ 




j 0.0. 


1 

.i 




j Borreiia purgaorieri '1 s-rai-n ' z bGl b ; "' 
1 oucer s . . . 


t ii^-i 




^ gb!L'Uiyy4 IBBUUTSr?^ 


aorreiia curgaorreri sJl outer surtace 
proce ... . " , . 


t 


e-176 - 






tiorreiia ourgaorteri gene ior ourer 
surface ... 




e-17b 






Borreiia ourgaorreri plasmd cp2b. 
complec ... 


^'V22 


e-i/fa [ 


emb|X&9byD |,£3l333 lUS'PC 


j: B . Durgdorr en ospc gene' ror outer - 

( surface .... ' ' ' . 


1 ts ib 


1 e'-i74 

i 




go 1 Ar 'j^ydbU 1 AFUjyat)':; 


'Borreiia ourgaorreri cci .oucer surface 
■ pro. . . ' , 


[ b:33 

■ 

t 




go ; u^i /'yy i.^auyi ;yy 


: rjorreiia curgciorr en strai.". L5 ou-er 
surface ... 


3iy,' 


■ 1 


go 1 L-i^od / ..oUKJu^uisrC 


Borre-.^ curgaorren sstram IcJ; oucer 
sur. . . 




e-i4b 


gb, Ly:i3. fi^UUOoiJCY 


Borreiia ourgaorr en ; , sucstrair. ser.su . 
stric:..'. ' 


boy 


■e-i-;'^. \ 
1 


gb|uyi7t?2|nBuyi7y2 ' -"i 


Borrena ourgaorteri scram Hi^ outer t 
surf ac. . . ■ . 


4 7^ ; 

i 


e-iiJ . 




gblUyi7y7i33Uyr7?T" J 

gbiuyiHoi jyuuyi-801 " ■ ■ "'^ 


Borreiia ourgaorren szrain IPJ outer i 
surf ac ... 1 


i 


e-i^b . 






borreiia ■:ourgaorteri strain Plr cuter i 
surf ac ... . : ■ 




e-i:J3 , 




■db>iAHCUlJ77JABU01-i77 " ' 


borre.Lia ^aponica strain KUb / UNA ior 
Que. . . ■ . 


4 JO 


e-lld 




dbj 1 AaDCiJ7y !AB00137K 

1 


borreiia japonica strain OvKX"; ol^A tor 
Ou . . . ' 


4J0 


e-liy 




emb|Xy47yj iHyuyPCTXW ! 


b .-Durgcorreri ospu' gene (strain 'i'XoW; | 


418 i 


e-ilb 




db3 )AilUOUJbbiAH000335 ' 


Borreiia tanuKii una ror Oucer surtace "" 
pr: . . 


.4iy j 

i 


e-iib 

I 




goi u^^i.Ty^r ;HiJuyi7yy 


borreiia Durgaorreri scram IPi oucer "~ 
surf ac ... 


4iy j 

! 


■■'e-ilb 




db] lAHOUlJ ;i> |AHU0i:j7o 


borreiia japonica scram Kiilo UNA tcr""~ 
Ou. . . 


414 '\ 

1 


e-i i4 




emb|XVJ6J4 tHBOaPCC 


b.Durgcorreri (UiC^b) OspC gene , "■■ 


404 ' 


e-iii. 




emb|X621fc^|HUPCG I 


b .ourgaorieri gene, tor pL' protein ' ■"" 


jyy ; 


"e-loy 




emb ! Xfc ybyo 1 HBWUDOSrr— J 


b.Durgaorteri OspCgene, J' end 


jyy j 


e-ioy 




ertib|Xm52i|HAOyPCl \ 


b.atzeiii iscram ^Bo] cspC gene ' 


377 ; 






ui^j )rtouuuj^:s 1 Aauuuj4b | borre^ia atzelii UNA tor Uuter sur^-ac^ ' 

i p^- • • . . 


itbr \ 


(>e-y5 




1 


Borreiia arzeiri gene tor -outer surtace 1 
■p... - 1 




ye-yy 




dbj 1 AHUoyyyy lAiiuuyayy 


borreiia arzeiii gene ror outer surtace" 
p. . . 


ibv 


ie-yt 




emblXUlS^J iHAUyyc^i ; 
gb| AVUl^yB ;i| At'0:J9S71 


b.atzeiii 1 strain PL^V! ospC gene ' 


J J b 


3e-91 






borreiia Durgaorteri OL'li outer surtace 
pr. . . 


33V 


ie-yo . 




db] tAuouyyy7 |ABDuy8y7 


borreiia atzeiii gene ior outer surface"" 
p . . . 


337 


ie-yo 




db] |AHU00Jb4 |AHUU0Jb4 


borreiia canuKii UhJA tor Outer surlace 
pr... 


337 


ie-yo 




gb|Lib41J|iJ0kUiiPC 


borreiia Durgaorteri membrane protein ' 
(ospC) ... 


33b 


be-yo 




GO] IU4 9bUJ jBOilUSPC't' 


borreiia atzeiii gene tor outer surtace • 
pre . . . 


33b 


be-yo 




emblXyjSbb IBUUNAUCJFC 


b .Durgoorreri (ii. paciticus strain; 1 

ospc gene 


33j 


Je-yy j 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 



83 



PCT/US99/02944 



Sequences producing significant alignments 




i 


Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 




aorreiia iDuraaorreri iscraan Orthl 
' outer su . . . ' 




be-.yy 


™ai:)^:j-hU4-ybu-j-|-bUKUbir^^vj^ {--b orred-i-a-a« e^i->i- gen e--r-er—Qui:- e-r-su-r race—- 

1 pro, . . 


-s-s-^ 


de--dy 




Borreiia aizeiii gene ror outer surtace 
p. . . . ' 




■ Je-BB 




Borreiia Durgaorteri vstraxn £6ij outer 
sur ... 


Je-BB 

1 


go 1 L4-idy^ j bUK-ibUbFL i rtorreiia Durgaorteri isrram .acai) 

1 outer su . , . 




Je-BB 


OD] j u^ybUi 1 bUKUSPC^ i fiorreiia atzem gene tor outer surtace 

1 pro. . . 




le-B7 


gOI } BBUD42 4G 


borreiia Durgaorieri g:-!? syncnetase 
(gucLA) a . . . 


J17 


ie-B4 


gb; lJ042yU iHHUC42c!C 


borreiia Durgaorieri ciMP syntnetase 
(guaA) g . . . 


'ilV 


le-B4' 


( 


Horreiia atzexii gene :cr outer surrace 
p. . , 


305 


je-Bl! 1 


ao} iAiSJ J:?dy j = ! bor re 1 1 a a 1 2 9 ill gene ror outer surtare ; JC:^ "e-H" 



go i UC 



t ^ z j. I ^ 4:: z . 



dJ3 j j AH V u y d y b j Ari G 0 2 ^ i 

emc I X d i D ^' I bbUi r Cl 



horreiia ourgdorteri — s:rain JSH; o-.iter 
3ur. . . 

ocrre^ia curgdcrten oucer surtace 

proz,^ ■ 

horreiia atze^ii gene" outer sufiface" 



-ITT 



i'bur 



cib3 !AS000i4!:^iAriv00j^^ 
emblX7J62b!BBOS?.rrr 



gdorteri istram " Bre; o 



:r Cuter 



003 |Ai30U0j-lfa !Ai>ju034c 



go j ij4^dyb !aOk2BUHFC 



Hcrreiia atzelii DMA 
pr 

li.burgdorteri iDKV) OspC gene 



P3 



ene~ 



lace" 



a garinii gene :or outer surtace 
or Outer surtace 



'Sorre 
Dro. . . 

Hcrreiia atzeiii DUA 



5rleri 



emb;XB15:;4 IBt^OSrC - 
db] iUyB2y6 jUBB^JJJc 



dbj |AH00yy02 j ABOui^yOz 
dbj |U4ybUb IBOROiiPCI 



gb|UUiBy:j|HHU01B32 



Bcrreiia curgdoi 

pr . . .. 

Hcrreiia ourgdorteri 
outer s . . . 



outer surtace 
.scram :^HJb4) 



TFT 



B.burador teri (shram r^bb) ospC aene 
Horreiia atzeiii 2oko circular Dxasmid 
DNA fc. . . 



a.garinii (strain WABb ' cu; osp(J gene 
li I 



>gi 1-8720 . 
B.garmii ospL' gene (strain DK32) 



Horrexia atzei.ii g ene ror outer surtace 
p. 



Borreiia garmii g ene :or outer surtace 
pro. 



db3 jU45j7y |yUiiUb'PCV-^ 



go I Af U2ybD4 j Ar'U:jybD4 
db] j ABOOOJbU j AliOUOJbv 



Horreiia burgdort eri J-jI outer surtace 
prot ... 

H.atzeli i (PLud ■strain) cspC gene 



1^ 



Borrelia garmii (strain HT54) ospC" ' 
gene f . 



gb| UUB2B4 }i3HU0a'^B4 

db3 I Ai3DOUJ4J |AH000J4:; 



db] [ AHOnOJbi l AHUOuj ^ 
emblXy47 ;y IBHU-^PLT^U: 



emblXy^Vta IBHUi^PUUi:; 



Borrelia atzelii .[ strain Vb ' 4fclj ospC 

gene ... 

Borrelia ourgaorieri CL ' b outer surtace 
pro. 



Borrelia atzeiii DK ' A tor Cuter surtace 
pr. 



Borrelia burgdorteri 1^^ ' / outer surtace 
prote ... 

Borrelia atzeiii DMA tor Outer surtace 
pr. 



,Bor r ej i a t anuk i i_L!;: A_t.o.r^au t.e r„surt a c e. 
pr- > • 

B. burgdorteri osoC cene (strain M\JL\ 

¥n '- — ~~ 



i B.atzelii ospU gene'istrain UKISJ 



T7F" 



1^ 



7W 



.Z5.b_ 



be-5I- 
le-75 



le-75' 



le-7B 



Te^ 



2e-77 



je-7D 



je-75 
je-76 



je-7b 



4e-7b 



4e-75 
4e-/b 

Te^r- 



7e-74 



le-7" ■ 



/e-71 



4e-cy 
4e-D9 



4e-&c 



4e-cc- 
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Sequences producing significant alignments 



Database/Accession No. 



borr.eiia aarinii 
DNA fo. . / 



Gene 

Circular uiasmia 



! Score 
I (bits) 



j borrei-a garinii gene tor outer suriace 

\ pro, . . 

I b. gar in 11 iNJ4 strain; ospL ge ne 

j borreiia curgaorreri uc"/ outer surtace — 



I pro. . . 

borreiia garinii circular oiasmic 

DNA fo. . . 



E 

Value 



2e-f)b 



TSTT 



db] |AHuoyyyi)ABUuytiya 



Ob] lAiiooydyy lABUoyyya 
gDiAfusyyoi !Ar'u;?yH6l — 



bcrreiia Durgaorreri (shram Hs) outer 
surf . . . • 

borreiia Durgaorteri OL"J outer surrace 



2h6 



pro . 



borreiia aczeiii gene ror outer surtace' 

P- • > 

borreiia atzeiii gene tor outer surtace 



4e-S3 



4e-63 



borreiia Durgaorreri OL'!^ outer surtace 
pro . . . 

b . Durgcorreri OspiJ gene, T^^eTT^ ' — 



borreiia garir 
gene : . . . 



(scram n^TT; osoc 



t "le-oi 



borreiia garmii gene ;cr ou:;er sur:ac = 



) Example 3 

Evolution of Broad-spectrum Vaccines zgzinst Mycobacterium 

Tuberculosis is an ancient bacterial disease caused by Mycobacterium 
tuberculosis that continues to be an important public health problem worldwide and calls are 
being made for an improved effort in eradication (Morb. Morta l Wkly Rep ( 1998 A ug 2 1 ; 



4e-63 , 



! ; pro. . . i j 


goi ^*4^cj?o i■DUK^yUbf'C 


! borreiia 
1 outer s , 


ourgdorteri zrrair: ^/b/-' 


i ^ ^ ^ 


; oe 

\ 

1 


■ o ^ 


l 013 3 i iJ4 y j 1 ^ORUyyL'':''L' 


; borreiia 
1 gene . . . 


ga r i n 1 1 s c r a i i LXb .-^ ; o s pL' 


; 24:' 


\ ^e- 

! 


°' i 

j 


gD| L4.:d /j 1 BORyUii'PC 


i borre.Lia 
i oucer su 


ourgaorreri isrram ^-l.^lWi; 


; 235 


:^e- 




dbxlD^ysyiibOuoypcj ■; 


sorreiia 
gene f . . 


garmii iscram HTJ'/j osp»J 


r 2^6 


5e- 


o'l 


dbj |D4y4ya!i3UUOKPCB 


dorreiia 
pro . . . 


garinii gene ror cuter surtace 


■ 


Se- 


O J. 


errujlXSybyLit UBT25Ui5PC 


b . ourgdc: 


-ten ospu gene, J' end 




fee- 




emi))X6yby4 i HaPHKOKPC 


b . ourgdor 


■reri uspu gene, end 




i^e- 


bd 


cio] !U4ybUb|BUU0Si'CJ 


borrei la 
pro. . . 


garmii gene :cr oucer surtace 


■ 220 ■ ■ 


. 2e'" 


b5 


ernoix&ysyb [yHPHiuyyc 


H . r>urgcoi 
surface . 


'ten ospu gene ror oucer 




*ie- 


bb 


etnblXyJbb4 i HGOSPPTKO 


b.garinii 


(prroD scram; opsC gene 




7e- 




emDiX-nS^blHHOSi'CI:: 


b.ourgaorteri lUKb) UspC gene 


220 


2e- 


bb 


goiL4'JB7U|BUR6USPC 


borreiia 
oucer sui 


ourgaorreri t scram vSuAJ 


22G 


-ie- 


03 


gb| L4-:iay4 I'HUU^VOBPC - - 


* borrei la 
outer 3 . . 


ourgaorter'i Istrairr 


21c 


ye- 


bb 


db3 |U4y5U4|BURUSPCH' ' 


borreiia 
pro. . . 


garinii gene ror outer surtace 


20H ■- 


5e- 


b2 


gb|L42y&y|HUU40SPr- 


borreiia 
surf . . . 


Durgaorreri is tram Zl^'l] outer 


"■'2Cii 

■ 


3e- 


bl 


db3 lAHOCUJby |AH00035B 


borreiia 
p. . . 


^aponica UUA tor Uucer surtace 


i- 


le" 


bO 


db] lAHUOaJbliABOODTSI 


borreiia 
p. . . 


^aponica LIMA tor Outer surtace 


1 


le- 


bO 
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47(RR-13); 1-6). It infects over 50 million people and over 3 million people will die from 
tuberculosis this year. The currently, available vaccine, Bacille Calmette-Guerin (BCG) is 
found to.be less effective in developing countnes andean increasing number of multidrug- 
resistant (MDR) strains are being isolated. . 
5 ■ The major immunodominant antigen of M. tuberculosis is the 30-35 IcDa 

(a.k.a. antigen 85, alpha-antigen) which is normally a lipoglycoprotein on the cell surface. 

Other protective antigens include a 65-kDa heat shock protein, and a 36-kDa proline-rich 

antigen (Tascon ei aL (1996) Nat, Med. l: 888-92). / 

Table 6 shows the output of a BLAST search using the 30-35 kDa major M 
0 Tuberculosis antigen (a.k.a. antigen 85, alpha-antigen) coding sequence to identiify 
homologous genes that may be used in a family shuffling lom^^iX to obtain improved 
antigens. Many homologous antigens have been cioned a,nd sequenced from a laree number 
of related yet distinct mycobacterial strains. These gene? are cloned by methods such as 
PGR, shuffled and screened for improved antigens. 

Table 6 



. Sequences producing significant alignmeats ! 




Database/Accession No. 


; Gene 

i' ■ • •■ 


Score 
(bits) 


E 

Value ^ 


gblUJyy^y iMTU3853i? ■ 


MycoDacterium cucercuiosis JO iU3a 
extracellu ... 


lyjb 


CO 1 




M . cuJDercuiosis istram Eraman) gene tor 
35-B a. . . ' 


1933 - 


0,0 . 1 




MycoDacterium ::uDercuiosis .4J7Uv 
complete ge . . . 


193 5 ;-" 


U . 0 t- 

i- 


emb|Xfc:2jy7|HB3S3 • 


K.novis tscram ii7jP2) gene tor tis-H 
antigen 


lyji 


0.0 "I 


gb|HJiUjy IMiiUSCGA i'""-' 


M.DOV13 bL'U gene encoding alpha - 
antigen, comp. . . 


1B59 


0.0 1 


emb|XS3By /(MKAAWTICJ, 

■ 


Mycotiacterium Kansasii -gene tor alpha 
antigen 


diy 


0.0 I 




MycoDacterium scroruraceum' UNA tor 
, alpha-antig . . *. 


706 


■o.c 


dbj | Ui6b4t iMyUAAC; . 


MycoDacterium intracexluiare gene tor 
alpha-a . . . \ ' ' 


531 


e-164 


emb|XtJ4J7 [MAALANT 


M. avium gene tor aipna-antigen 


bfey 


e-lbO 




MycoDacterium mcracei iulare UMA tor 
alph. . . 


bJJ 


6-149 j 




M. leprae genomic dna secruence, cosmid 
B38 ... 


iVl 


"e-iCO 


emb|XfeOyj4 1ML35B7^ 


M. leprae gene tor tD-B. antigen 


Jbi 




emb|Zilbbb|HLJr'BPAPH * 


M. leprae riDronectin-binamg protexn 
antige ... 


347 ■ 


^e-yj 


gbjMliVOlb IMIdUJI^KDA 


MycoDacterium tucercuiosis '12 kila 
antigen gene. 


J17> 


-;e-d4 


emb|XbJUJ4 iMHJUPG 


MycoDacterium dovis gene tor J^kJJa 
protein 


J17 


Je-a4 


db3 |UJe>4Hb jMSUJJKDAP 


MycoDacterium dovis genes tor JUkDa 
protei . . . 


jlV 


■-ie-B4 


emb|ALOJJU ;b jMTV0:!6 


MycoDacterium tuoerculosis HJ /Rv 
complete g . , . 


"■J IV 


Je-y4 
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Sequences producing significant alignments j 

■ . ■ i 




Database/Accession No. 


1 Gene 


1 Score 


E 

Value 


! 




Mycooacceriuni tuoerc-ilosis 
extracellular 32 . . . 


Ji7 


2e-tiA i 

. I- 




Myccbacterii- "< dovis gene tcr aipna ■ 
ancige ... 


3 0.5 


} be- 62 

1 




emb 1 V i C J 7 y 1 MGti bAAN'l' 


i M.goraonae gene encoamg bb-^ antagen 


1 JOJ 








1 iMycooacterium avium gene lor Mfibi, 
1 ancigen 8 . . . 


j 2bb 


^e-6o 


\ 


gbjMytJo^y IMS'GBSAA 


i MycoiDacceriurri leprae bb-A ancigen gene, 
1 compl ... 




6 e - 6 0 




dbj ii:43«4i {Mb'UAybCA 


MycoDacteriun leprae UNA ror antigen a:; 
com. . . 




;■ be- bo 




erTib|X92bfc7iKMAKI14 7 


.^.marmurn gene ror j2 KUa protein 
{partial ) 


'21'6 


ie- b4 




emDi ZJ JfcbU !MA3LiKP:6 


1 avium {/Vi^^ lyu/b; gene ror J^KDa 
protein ... 


1 ly^ 


7e-4V 




od;) !iJd/JJJ|Uy7J2J 


Mycobacterium avium gene ror antigen 
BSC and ... 


j 18 c 


be-4b 




f eniDi Zj J6b ; lyjVJ^Ki^lb 


.M. avium (ATCL" ij/6y; gene tor J^KUa 
protein. . . 


i 


:i:e-44 






. intraceiiUiare iATC'^' ij^du; gene ror 
32k. - . 




le - J>> 




eiTU) 1 x^2ooh I yj<£iAl'2'2 


M . asiac icur:i gene ror kDs. protein- 
■ {partial) 


i6o ; 

t 

1 


ie - J 5 




emb| VU77ibjyAj:^Ki^K01 


.''1 . as la -icum gene segment ct j^-,'Ci;a 
protein 


1 b c 


ie - J 




enu: i z D u / b J 1 2Kb 1 1 


M.aviur. comoiex cene rcr j ^ .-ciJa orotem i 
(pa... ■ . ■ 1 


1 6 c j 

1 


i e - J 5 1 

I 






M.aviu^> complex gene rcr .•cJJa crotem 1 
(?• ■ - ' ' i 


lb7 


f 




i 


M. avium complex gene ror KL/a protein 
i pa . . . 




jie-37 ! 

1 


i 


ema } zb u / / 4 i XAj ^ Ki* 6 6 


.M. avium complex gene ror KDa protein 
{pa . . . 


ISl 


Je-37 1 




emo 1 Z3 ^ b y : MAJ '2 Ki^ 1 7 


M. a VI urn iATL'C 1^3074? gene rcr J^icua 
protein ... 


161 


i e - 3 / 


emo 1 zjjbbi.; Mi Jl^ KPi:; i 

i 
1 


. mtraceiiuiare (atul: Jb/o^i) gene ror 
32JO. . . 


, ic i 


je- J 7 


. ' ' ' ■ ' 


emD|ZbO /bJ jHAJ^Kbby i 

1 


M. avium complex gene ror KUa prctem 
{pa . . . 


X O 1 


je-j/ 1 

j 


-,' '» , 


enaoi Zr>07 /-^ i MrtBlJKybl , 

! 


M. avium complex gene rcr- xua protein i 
{pa... '! 




4e-j6 i 

i- 




eniDi Zb076b iMA32Kb /'fc 1 

I 


M, avium complex gene ror j,^ Kua protein 
( pa . . . 


Ibj 


be-jD 




emo j Z5 0 ; / U jMA3:2Ky04 


M. avium complex gene tor S2 icDa protein 

(pa ' . - - . 


13J 


be- J:^ 


.... , . . - 


etnb|ZJJ6b y |MMJ:JKPilb 


M.maimoense gene ror S'2iWd. protein 
{partial ) 


IbJ 


be-3b 




emb| Zb07by jHAJ:^K5i4 - ■ 


M. avium complex gene ror Ki^a protein 

( pa ... y 


Ibj: 


6e-35 




emD| z^o IhA |MAJ:*!Xb7b 


M. avium complex gene ror ^2 .-cL-a protein 
Ipa ... 


14?,. 


le-33 






M. avium complex gene ror Kjija. protein 
(pa. . . 


14, 


^ e - 3 




emo 1 A b 7 l<i :2 y 1 MTB bL"G . 


Mycooacterium tuDercuiosis gene tor 
antigen 8 . . . 


14^ 


2e-3^ 




emn 1 Zb 0 ; b i 1 MAJ 2Ki) b4 


M. avium complex gene tor J2 KDa protein 
(pa ... 


14b 


'2e'y2 




emb|zy:<!7;u|M'irib ■■ 


MycODacterium tuberculosis hjvkv 
complete geno. . . 


14b 


2&-'S7 , 




ecub|Xy:2b IMyZULH 


M.szuigai gene ror J2. icDa protein 
(partial) 


141 


:<!e-31 




erab| ZbO ;bU |MAj:!KiU7b 


M. avium complex gene ror J2 KDa protein 
(p... 


127 


Ae''2'/ 




emb|Zb07bblHAJJKb77 


M. avium complex gene ror jwi Kua protein 
(pa. . . 


i:2J 


be-lib 




_eruD_|_z 3 J b b_4 I RH J :2 1 2 


M.Dovis (tiL\j) qene ror j^KUa protein 


iiy 


ye-"JS 






(parti . . . 




I 
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Sequences producing significant alignments 






Database/Accession No. 


' Ann 

t oene 


(bits) 


F 

Value 


i ■ 


>!.triviaie gene tor J2 KiJa protein 
( part ial ) 


luy 


9e-22 


: embiX52Syj \l'\Ch:L12jo 


K.ceiacum gene rcr >a^a pro tern- 
{ partial ) 


iOU 


Be-iy 




i M. leprae or obA .protein gene • 
j' >gij 287923 1 emb . . . 


' 50 


8e-15 




Mycobacterium, leprae cosmia '62b u:vA 
sequence 


' SB 


i je-ib 1 

i ' ( 




! X. leprae or b^c, protein gene ■ 


Ob 


; :3e-ib ^ 1 




compl . . . ' 


35 


3e-lb 


{ clluj 1 A ^ ^ 3 O ^ 1 rl^Ciiji. A J 3 

i-' ' . 


(partial) 






!" ' ■ 


(partial) 


i52 


2e-ij ! 

1 


" AnTPT? yq^Jg; m i 'MTTT77nn77" 


(partial ) ' 


B2 


2e-i3 1 

1 


! 


rl ■ 1. 1 d V c o c i ( o ^ciLC sc^iIiC.aU \J*, TS.x>a 

protein 


ti2 


^ e - 1 J 


erriD (Aj^d / j :rir i i j * * 


; partial) i 




Se-XJ . 


c rnJj 1 ^ J J o o J j rlA J ^ X\ r 1 X 1 


3 2... ' 


1 


■ 1 e - * ; 




M.peregrmum gene tor /a;-a protein j 
(partial) i 


74 i 


3e-i-l 1 


eruDj Zb0775 j MAHUiay^ 


M. avium complex gene :or S2 XiJa protein f 
(pa... _ " . i 




3 e - 0 j 


embiX52b7I !MXENO20: 

! 


M.xenopi gene tor J2 ,oa protein i 
(partial') . • '\ 




je-uy 




M.smegmatis gene segment c: Jz->u;a , j 
protein i 




le-yy ^ 


emb|2D0759iMA32KB22 > - j 


M. avium complex gene :,cr j2 KUa protein- j 
( pa . . . ' ! 


50 


.7e-y7 

1 


■ • i 


r^. avium comoiex gene ror j2 KUa, orotem j 
(pa. . . ■*" \ ' ■ 1 


50 


7e-a7 1 

I' 


embl Y077iy IMVi2KPKGb 


^t.vaccae gene segment oi j'2-xua protein j 


54 


4e-0b 1 


embiX925ya!MVAC:yi 

i 


M.vaccae gene tcr 32 xJJa protein 
(partial) 


b4 


4e--:Jb 


emb|X92b74 |MN0NC:4E> j M.ncncnrcmogenicum gene :or- KiJa 

\ protein ( . . . , . . 


4y , [ 


O.C03 


emt) j X9'^56y ! MSXMiyb 

■.! 


r^. simiae gene ror j-i icua protein 
(partial) . . 


4B 1 




eiTib|AJ0C2150 jMTAJ21SC- 


Mycodacterium tuDercuiosis riJ/Kv, MPT^i 
gene 


45 


u.g:i 


emb|ZVy700|MTCV10D7 


MycoDacterium tuDercuiosis hjvrv 
. complete g . ■ • 


44 


d/i7 


■gb|M5847:j jATUUAT j 


A. tumetaciens cniorampnenicoi 
acetyltransferas. . . 


42 




emblXy^bVB iMiiHKG90 


M.smegmatis gene ror j2 icua protein 
(partial ) 


40 


0 . bo i 


enJD 1 ^ b / ^ 1 MTtK^ b 0 


M.terrae gene tor 32 KUa protein 
(partial ) 


40. 


0.55 i 

\ ■ a: 


etnbjXy^bty !MiiL'KU149 


M . scroruiaceum' gene ror j2 Kua protein 
(par. . . 




0.65 


embiZ3366& |HtJj:^KPI14 


M.goraonae lAiuC 144 /□) gene ror j^KJja 
pro. , . 


4U 


0.55 


gb|M1770Ul fLUNPCA 


intiuenza Li/caiitornia/ /a nucieoprotem 
RNA ( . . . 


' 33 

t 





Example 4 

Evolution of Broad-spectrum Vaccines against Helicobacter pylori 
Chrome infection of the gastroduodenal mucosae by Helicobacter pylori 
bacteria is responsible for chronic active gastritis, peptic ulcers, and gastric cancers such as 
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adenocarcinoma and iow-grade B-cell lymphoma. An increasing^ occurrence of antibiotic- 
resistant strains is limiting this therapy. The use or" vaccines to both prevent and treat 
ongoing infections is being actively pursued (Crabtree JE (1 998) Gut 43; 7-8; Axon AT 
(1998) Gut 43 Suppl i : S70-3; Dubois et ai (1998) Infect. Immun. 66: 4340-6; Tytgat GN 
'S~^^{X9n)'li'liJnMtrPlm^ — 
Marchetti et aL (1998) Vaccine 16: 33-7; Kleanthous et aL (I998)5r. Med. Bull. 54: 229-41; 
Wermeille et aL ( 1 998) Pharm. World ScL 20: 1-17. 

Identification of appropriate Helicobacter antigens tor use in preventive and 
therapeutic vaccines can include rvvo-dimensional gel electrophoresis, sequence analysis, and 
10 serum profiling (McAtee et al. (1998) Clin, Diagn. Lab. ImmunoL 5:537-42; McAiee et aL 
(1998) Helicobacter 3: 163-9). Antigenic differences berween related Helicobacter species 
and strains can limit the use of vaccines for prevention and trearment of infections (Keenan 
etaL {\9m FEMS Microbiol Lett. 161: 21-7). 

In this Example, DNA I'amily shuffling of related ye[ immunoiogicaily 
15 distinct antigens allows tor the isolation of complex chimenc aniigcns that can provide a 
broad cross- reactive protection against many related strains and species of Helicobacter. 
Mouse models of persistent infection by mouse-adapted H. pylori strains that have been used 
to evaluate therapeutic use of vaccines against infection are used to evaluate shuffled 
antigens (Crabtree JE (1998) Gut 43: 7-S; Axon AT (199S) Gut 43 Suppl l:S70-3). 
20 The vacuolating cvlotoxin (VacA) and c\aotoxin associated gene products 

(CagA) have been evaluated as a vaccine against H. pylon infection in animal models which 
supports the application of this approach in humans. ' ■ 

Table 7 shows the results of a BLAST search using the H. pylori VacA gene 
to identify homologous genes that can be used in a family shuffling format to obtain 
25 improved antigens. Homologous antigens have been cloned and sequenced from a number of 
related yet distinct H. pylori strains and additional natural diversity can be obtained by 
cloning antigen genes from other strains. These genes and others or fragments, thereof are 
cloned by methods such as PCR, shuffled and screened for improved antigens. 
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Table 7 



j . \ ' Sequences producing significant alignments 

■1 ■ ' ■ ■ ■ "' ' ■ 






i Database/Accession No. { Gene 


Score 
(bits) 


Value 


go [.uyb:^ /I jrtP.u-?597i ! heiicooacter pylori i'srb,.^ v^izb) 
■ ' i inactive cy . . . ■ - 


i; 7874 


;0. 0 .^ 




heiicooacter- pylon section /o oz 
134 of . . , 


246B 


0.0 




, go | AFOUi-ibd |riPAF00135t3 

{,embiZ26ydJ.|HPCnTO>: - 


Helicooacter pylori vacuolating, 
cytotoxi ... 


. 2405 ■■■ 


U.CJ 






' H. pylori gene.rpr cytotoxin. ^ [2389 


• 0.0 




.;, gblU055V7i HPU0S6V7 


Helicobacter pylori e7-2U3 
vacuolating cytot . . . 


23B9 


■'0.0 . 


i 
f 

1 


gb|U0b6;6|HPU0Ei6Vfc 


Helicobacter pylori ooiyo j 23bb ■ 
cysteinyl - tRNA syn . . . . .j 


a.o, . 


gt> | U2y4 01 1 HPU2y4 01 


Helicobacter pylori ; vacuolat mg' 
cytotoxin ho. . . 


'2345 


0.0 




erar)(AJ0069by jriPYbyey Helicobacter pylon vacA gene, • !.2103 
■ , I strain Mz2 8 . . ' . : 


0^.0 . 


.gt> ; 5/^4:^4 1 ii/24y4 ■ i.i4U,Kc:a cytotoxm -HelicoDacter ^ f^OSO;.' 
| - , i pyiori, Genomi . ... j. 


0^. i; , 


gD|U0-/i45|HPUU7l4b. ■ | Helicobacter pylori .\CTC Ili^d j; 2050 .: 0.0 \ 

1 cystemyi tRN.-. , ' ■ ' . i ! ' . j ' " 




enUo iAuUU6ybdjHPy6563- 


rieiicoDacter pylon vacA gene, 
strai'n Mz2 6 . . . 


! 

. . E 


emi) 1 A-J0Uby//u j nPV6 970 ■ 


Helicobacter pyion" vacA gene, . 1,1^^2 j 0.0 
strain M229 ... , " ' ' ' . 1 . r ' 


gi3,|.AF0:/7y3y.j Ar077535 , _ 


neliccbacter; pyion strain lo-o- ,. ri834 
vacuolating. . . | 


0.0 


gp:j Af;uY /y4U j^Ar077940', Helicobacter pylori. 3 tram =33.- ■ {: lti,i4 

i' vacuolating-. . . ' ■ i ■ ' ' 


o-.o 


gb| AK077y41 |AK0V7y41 1: Heiiccoacter pyiori strain 549 , | 1778; 

* ■ , [vacuolating.:. * " '\ ■ 


0.0 


gd j Ab'o,/ ;y3y t Ai:'077y3B i 


Heii'Cooactex pyion strain' :bu" . . , 
vacuolating- .... >' ' 


1745 


o-.u. 


go:} Uo j;-^;oi5 1 iii^ub3><ibb i Heiicooacter pylpri. vacuolating. 
^ ■ ■ 1 cytotpxin ge-. . . ' ' 


a35 ' i 


0.0 i 




gd{ Ub3^ /U |HPU5j27G . 


Helicobacter- pylori vacuolating 
cytotoxin ge! . . 




0 .-0 . 1 




gb|U6J272|HPao3272 


Helicobacter pylori vacuolating, 
cytotoxin ge .... 


819 ■ p 


0.0 


gb|U6J2a:i IHPUS32S3 


Helicobacter pylori vacuolating 
cytotoxin ge . . . 


919 


U-0 


gb| U632B4 |KPU63284 


Helicobacter pyion vacuolating 
cytotoxin ge. . . • 


815 


0.0 . 


gb) U632b^ iKPUS3262 


Helicobacter pylon vacuolating 
cytotoxin ge . > . ' 


803 


0.0- ■■■■■ 


gb|U532by IHPU5326B 


Helicobacter pylon vacuolating 
cytotoxin ge.. . . . . 


803 


.0.0 - 


gb| U6327J |riPU^3273 


Helicobacter pylori vacuolating ■ 
.cytotoxin ge. . . '■' . ^ 


803 


0.0 ' ' 


gb|U63232|HPU63282 


Helicobacter pylori vacuolating 
cytotoxin ge. . . • 


.803 


'0.0 


gb|.Ui32by IHPU53259 


Helicobacter pylori vacuolating; 
cytotoxin ge . . . 


,795 


0.0 


gb| U6J2Vb iHPU5327€> 


Helicobacter pyion vacuolating 
cytotoxin ge . . . 


779 


u.o 


gb|Ub328/ IKPU532B7 


Helicobacter pylori vacuolating 
cytotoxin ge . . , 


77 y 


■u.o ■■- 


gb|U63253 IHPUS3253'"" 


•Helicobacter pyion vacuolating 
cytotoxin ge . . . 


771 


0 . 0 ■- 


gc i Uo3-iby 1 HPUb 3259 1 HeiicoDacter pylon vacuolating 

1 cytotoxin ge . . . 


771 


0.0 
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Sequences producing significant aiignments 1 i , 

.1 1 i 


Database/Accession No. 


Gene 


j Score 
(bits) 


i Value 1 


gb\\Jhr2Hh\HPVB3'2Bc 


Heiicooacter pyicri vacuolating 
cycotoxin ge . . , 


■ 771 


u . u 




HelicoDacter pyicri vacuolating 
cytotoxin ge ... 


76:3 ' 


■I 


0.0 


Sb\\J6r2 J\)\H9U63279 


HeiicoJDacter pylori vacuoiacmg 
cytotoxin ge . . . 




j 


0.0 

! 


gb|i;S32;; [HPU63277 


1 Heiicoi^acter pyiori vacuolating 
cytotoxin ge . . . 


7SS ■ ■ 




y . c i 


gb|U632B0|HPU63250 


Heiicot>acter pyiori vacuolating 
cytotoxin ge . . 




0.0 : 

i 


gb| U632bS |HPU6J:;6b 


Helicooacter pyiori vacuolating 
cytotoxin ge ... 


747 




0.0 i 


go| ubJ^t ; (HPU6J267 


Helicooacter pyiori vacuolating 
cytotoxin ge . 


' 747" " 




0.0 1 


gblUbJl^jyi IHPU63281 


Helicpoacter pyiori vacuolating 
cytotoxin ge, . . 


74^ - 




0.0 


gD| UoJioi |HPU532ol 


heiicooacter pyicri vacuolating 
cytotoxin ge . . . 


73 5 " 


! 

i 


J.O 


go|U6J274 }HPUdJ274 


rieiicooacter p/iori -.'acuoia t mg 
cytotoxin ge ... 


73 S ' 


; 0 . 0 


go I U6 3 ^ b b i H PU 5 3 'J 5 5 


HeiiccDacter pyicri , vacuolating "" "7 
cytotoxin ge ... " ^ 


737 


-0 '. 0 . 

" i 


emo 1 AJUUi34 3u iH?AJ34 3 0 • 

! 


neiicooacter pyiori vacA gene : 
(partial ) , . . . j 


737 

■ 


3.0 -i- 


emJDl AJOOy-lJb iHP;u;94j5 i 

i 


Heiicooacter pyiori vacA gene , ' f 
(partial ),.. . | 


730 


0.0 ...| 


emD| AJ00y4jy iHPAJ94 3 5 j 


neiicooactsr pyiori vacA gene " "J 
(partial) ^ . . . [ 


730 


u . a ■ 


gb|Ui>j:^7ifHPU63271 I 


heiicoDacter pylori vacuolating ! 
cytotoxin ge . . . " | 


724 i 

i 


I 


-.0 1 


emo| AjOUy4iB !HPAjy4iy 


Helicooacter pyiori vacA gene 
(partial ) , , , . 


722 


0 


. 0 - j. ' 


emo\fuj0i)9^22 ;HPAJ3422 | 


neiicooacter pyiori vacA gene ] 
(part ial ) , , . . ■ [ 


722 i 

i 


0 


• ■v ■■■\ 

t 

1 ... 


gb|U5323&|HPUD32S6 i 

i 


neiicooacter pyiori vacuolating 
cytotoxin ge. . . 


715 ! 

j 


0 


. i 


gb| Ue)J266 |HPU532o6 


rieiicooacter pyiori vacuolating j 
cytotoxin ge . . . i 


71c 




.0 1 

i ■ 


emnl AJo0y42U |HPAJy420 


Heiicooacter pyiori vacA gene "T 
(part ial ) , . . . '< 


714 


u 




- emb 1 AJ U 0 y 4 2 4 1 HPAJ 94 2 4 


HeiicoDacter pyiori vacA gene' ' 

(partial ) , ... 


714 


0 


.0 j ■ 


emb|AJU0y4ii |HPAjy4il 


heiiccoacter pyiori vacA gene """" 
( part ial ) , ... 


714 


c 


.0 ■ ■ i 

I 

1 


gb|U6J26UlHPUfc325G 


HeiicoDacter pyiori vacuolating 
cytotoxin ge . . . ■ 


70B j 

1 


L 




gb|Ui>i27yiHPU£,3275 , -[ 


Helicoi^acter pylori vacuolating 
cytotoxin ge . . . 


"7C8 j 


0.0- 


emoi AjOUy4iy IHPAu'y413 


HeiicoDacter pyiori vacA gene 
(partial) , . . . 


706 


0.0 j 


emb|AJO0y42B |HPAjy42B 


Helicobacter pyiori vacA gene 
(partial) , ... 


706 


0.0 j 


emb|AJ00y437 iKPAjy437 


HeiicoDacter pyion vacA gene 
(partial) , , . . 


706 


0.0 j 
1 


emn 1 AJuoy4:i7 jHPAJy4:=i7 


HeiicoDacter pyiori vacA.gene ■ 
(partial) , . . . ' 


■704 


0.0 




gbtUbi2b7|HPUS3257 


HeiicoDacter pyiori vacuolating 
cytotoxin ge . . . 


700 


0.0 




emb|AJ00y4^J IHPAJ9423 


HeiicoDacter pyiori vacA gene 
(partial ) , . ' 


598 


0.0 ■ 
i . . 





(1 
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. Sequences producing significant alignments 




i ■ i 


Database/Accession No. , 


Gene 


Score 
(bits) 


^ ^ 1 

Value 1 




rielicoDacter pyiori vacA gene . i by^ 
(partial ),■... \ 1 ■ . , 


0.0, i'^ 


emb i AJ0O^,4iV [Hi'AJi*4-i7 . 


Helicooacter pylon vac/^ gene ■ 
(partial) ... ■ ^ ■ ■ 


bbO 


.0.0: 


' emb; AJ0Gy42i.lHPAjy421 i HelicoDacter pyion vacA gene j buo 
■ ". .} (partial) , .'. . . ; 


0.0 


erriD|AJ00942D jHPAJ9426 


■ Helicooacter pylcri vacA gene 
. (partial )<...... . ' 


bdtJ ■ 


O.O' 


embiAJ00943S iHPAJ543B * 


Helicobacter pyiori vacA gene 
- (partial ),.-..■ 


588 


0.0 


gb! |HPU53264 


Helicobacter pyiori vacuolating 
cytotcxin ge . . . 


57 b 




embi AJ00 94 3 3 |HPAjy4 JJ 


Helicobacter pyipri vacA gene,, 
(partial ),,;..,„"'■ 


666 


0.0 


j-emb| AJ00 94.::^S | HPAJ94:^b 


HeiicoiDacter pylon vacA gene . 
(partial )./ .... ■ ' / 


55B 


0 .^0 , 


^ gb|U63z5B IHPUfa'J'JBB ' 1 Helicobacter ; pyipri vacuolating 
' ; cytocoxin ge . . . ^ \ 




0.0 , 


embi AJ0;Gy44:^ lHi:^AJ544:j ■ • : He-icobaccer. pyiori vacA, _gene \oZo ■ re-x/,/ |, 

■ ' (partial) , . . . ' ' ; j ■ M 


emb! AJ0C9444iHPAJ9444 Keiicobact.er pyiori-- vacA gene • \ . .i's--/-' | 
" \ ' } (partial) , . . . ! J | 


gb| UBOOSB jHPUBOOSB i Heiicoi:acter pylori strain iij, o22 |e-^/b, j 

1 vacuolating , . , , ' ! . ' ' j' ' ! 


emb| AJ00y4i4 iHyAJy434 , | HeiicoDac.ter; pyiori vac^ gene . i old | 
' 1 ' (partial) , .... \ L' • 


e 1 1 H 


embiAJ009441|HyAjy441 i Helicobacter pyiori vacA gene ibUo. 

_ . • ; (partial) ,..:■,,* 


e-170 ; 

■ i 


i gbi Ar0355i6 iHPVCPli j HeiicoDacter pyiori . 5 tram kj4^ i.byy 
• , i vacuolating ... ■ | . 


e-i6B . 


j emb| AJ009447 |HPAjy447 

• ; 


Helicooacter pylon vacA gene ; 
(partial ) , , . . - - \ . ! 


b b / ■ \ e - 1 b 

■ ■ . 'i 


embj AJ00 944 0 IHPAJ944 0 


Heiicpoacter pyiori vacA gene 
(partial ),...,' ■ 




e - lb b 


j embi AJ009436 |HPAJ943o 


HeiicoDacter pylon vacA gene \.^^.=> 
(partial) , , , . ^ ' . , I . 


e-l'bb { 

■ I 

!. 


embl AJ00944i.| HPAJy44i 


HeiicoDacter pyior> vacA .gene 
(partial) , ' . . . 




e-155 ' 


emb |AJ00944fe |HPAJ944b 


HeiicoDacter pylon vacA- gene 
(partial ),..,■ . " 


5bb • 


e - 1 5 b 


gb'[U80057 IHPUB0057 


Helicobacter pyiori strain 
vacuolating ... . 


5b3 


e-lbb 


gb| AFU4273ii |AKU427'3 5 


HeiicoDacter pyiori jk^^ 
vacuolating cytot... ' ' 


5b J ■ ^ 


' e-155 


emb|AJ00944b IKPAJ944S 


Helicobacter pyicri vacA gene^ b4 / ■ 
(partial) , . . . " I • 


e-153 ' 


gBlAF0356C9|AK0Jbt>0y 


Helicobacter, pylon strain RlUA 
vacuolatin. . . ' ■ ' . 


b4 / 


e-lD3 


gb|Ai-'04:2VJ4!AVU4:j734 


Helicobacter pyiori JKl vacuolating ' 
cytoto. . . 


biiV 


e-150 


gblAFOibti^ IAF0JB512 . 


HeiicoDacter .pylon strain R26A 
vacuolatin . . 


"347 


se-93 


gb|AKUi5613 |AF0i5613 


HeiicoDacter pyiori strain R40A 
vacuolatin. . . 


323 


le-B5 


■embf AJ0069£.7 IHPY6957 


HeiicoDacter pylon vacA gene, 
strain. Mzl9 ... 


317- 


8e-84 


-gb-!AF03561biHPVCPl 


Helicobacter pylon strain K34A 
vacuolating ... 


315 


3e-B3 


gblAF0Jbfel4 |AF0i5514 ' 

i 


Helicobacter pylon strain KbOa 
vacuolatin. . . 


307 . 


8e-Bl 
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Sequences producing significant alignments 



Database/Accession No. 


Gene 


; Score j E 
1 (bits) 1 VaJue 


j gbjUyiSVa IHPU9157S 


Helicobacter pylori strain J / vacA 
gene, pa . . . 


loy 4e-21 


! gD f uyib /iy i HP uyi57i^ ! HelicoDaccer pyiori strain r /y vacA 
J . .!, gene, pa ... . 


loy 


4e-21 


j go 1 uy 157b : HPuyi57o ! Heiicooacter pylori strain rb4 vacA 
j. j gene, pa . . . 


rio7 
1 


le-20 1 


! emjD 1 Yi474^ j riPVACA26 (Helicobacter pyiori partial vacA 101 ; 9e-19 
j I gene, stra ... - j j 


■ gD|uyi577 (npuyi57 / 1 Helicoijacter pylori strain i- 94 vac A 

1 gene, pa. . . 


100 3e-15 i 


gD[uyi576 {HPuyib /6 ; Helicobacter pyiori strain Ir'Vi vacA 

j gene , pa . . . 


100 


je-lS } 


gD| AF03 5610 (Ar 03 5 610 


Helicobacter pylon strain kua 
vacuolatin ... 


94 


2e-i6 


gb|UyibaO|HPUyi580 


Helicobacter pylori strain tau vacA 
gene, pa , . , 


92 I 5e-I6 

i 


gD|AfUJ5cii ;Ar 035611 j Helicobacter pyicri strain R55A i 92 

1 vacuolatin ... • 


0 e - 1 6 } 

I 


eroD j Y^';74 4 i HPVACA*; ^ i Helicobacter pyiori partial vacA eg | ie-14 ; 

! gene , stra . . '1 \ 



Table S shows the results of a BLAST search using the //. pylori CagA gene 
to identify homologous genes that can be used in a famiiy shuffling formal to obtain 
improved antigens. Homologous antigens have been cloned and sequenced from a number of 
related yet distinct H. pylori strains and additional natural diversity can be obtained by 
cloning antigen genes from other strains. These genes and others or fragments thereof are be 
cloned by methods such as PCR, shuffled and screened for improved antigens. 



Table 8 



Sequences producing significant alignments | 




Database/Accession No. 


__;Gene _ 1 

i 


Score 
(bits) 


E 

Value 


gb|AF083352|AF083352 


Helicobacter pylori cytotoxin associated p... j 7041 


0.0 


gb|AE000569|HPAE000569 


Helicobacter pylori section 47 of 134 of... ; 


5501 


0.0 


gb|L11714|HECMAJANT 


Helicobacter pylori major antigen gene 
sequ... 


4976 


0.0 


emb|X700391HPCAI 


H.pylori cai gene for cytotoxicity associated 


4294 


0.0 


gb|U60176|HPU60176 


Helicobacter pylori cag pathogenicity . 
island... 


4294 


0.0 


dbj|AB003397iAB003397 


Helicobacter pylori DNA for CagA, 
complet... 


4274 


0.0 


gb|U80066|HPU80066 


Helicobacter pylori strain 213, cytotoxin- 
as... 


349 


2e-93 
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Sequences producing significant aiignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


; gb|U80065|HPU80065 | Helicobacter pyion strain i 84, cytotoxin- 
■ ■ • f -as... 


^ 343 


le-91 1 


gb|AF043488!Af 043488 | Helicobacter pyion JK252 cytotoxicity 
j ■ ^ j ass... 


178 


4e-42 


gb|AF043487iAF043487 


Helicobacter pylori JK25 cytotoxicity 
asso... 


170 


le-39 


gb|AF043489|AF043489 


Helicobacter pylori JK269 cytotoxicity 
ass... 


163 


2e-37 


emb|X7003 Sj'HPC AID UP ' 


H. pyion DNA duplication sequence within 
th... 


159 ! 4e-36 j 

i 1 


gb|AF043490|Af 043490 j Helicobacter pyion JK22 c\^otoxicity 

. . ■ ! asso,.. . 1 


153 1 2e-34 { 



Example 5 

Development Of Broad-Spectrum Vaccines Against Malaria 

This Example describes the use of DNA shuffling to generate improved 
vaccines against malana infection. .An excellent target for evolution by DNA shuffling is the 
Plasmodium falciparum merozoite surface protein, MS? 1 (Hui et ai (1996) Infect. Immun. 
54: 1502-1509). MSP 1 is expressed on the surface of merozoites as an integral membrane 
protein: It is cleaved by parasite proteases just before and concomitant ,with rupmre and 
release from infected cells. The cleavage appears to be obligatory for full function in MSP 1 
binding to RBC receptors. The cleaved fragments remam attached to the membrane of the 
merozoite. Other membrane proteins on merozoites also participate in the attachment and 
specific invasion events. MSPl is a proven candidate for inclusion in a vaccine against the 
asexual blood stage of malaria. 

. The genes encoding MSP 1 can be isolated from various isolates of 
Plasmodium falciparum merozoites by PGR technology. Related naturally existing genes 
can be additionally used to increase the diversity of the starting genes. A library of, shuffled 
MSPl genes is generated by DNA shuffling, and this library is screened for induction of 
efficient immune responses. 

The screening can be done by injecting individual variants into test animals, 
such as mice or monkeys. Either purified recombinant proteins, or DNA vaccines or viral 
vectors encoding the relevant genes are injected. Typically, a booster injection is given 2-3 
weeks after the first injection. Thereafter, the sera of the test animals are collected and these 
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sera are analyzed for the presence of antibodies that reduce invasion of merozoites into 
uninfected erythrocytes (RBC). RBC are infected by the merozoite, immediately inside the 
RBC, the merozoite differentiates into a ring and this matures to a schizont that contains 
several nascent daughter merozoites, which then burst out of the infected cell, destroying it, 
" T^' and go on to affacfi"and1h\^e~aiibtfTer is iikely only 

extracellular for seconds. In vitro, any blockade of this event can dramatically reduce ilie 
level of reinfection. Antibodies against MSP 1 bind to the surface of merozoites that are 
released from schizont infected RBC when they rupture and thereby reduce the ability of 
. these merozoites to attach and engage cognate RBC receptors on the uninfected RBC 
10 surface. Merozoite attachment is reduced, merozoite entry into new RBC is reduced,, and the 
numbers of newly invaded cells detected at the early ring stage is therefore reduced if the 
culture is examined several hours after the blockade of invasion test. In riome assay- formats 
a surrogate of merozoite invasion inhibition is to note the appearance of aggjuiinated 
merozoites, although this is an indirect measure of antibodies that cause reduced invasion. 
15 The shuffled antigens that induce the most patent antibody responses 

reducing invasion of merozoites into uninfected er>ihroc>1es are selected for funher testing 
and can be subjected to new rounds of shuffling and selection. In subsequent studies, the 
capacity of these antigens to induce antibodies in man is investigated, .\gain, either purified 
recombinant antigens, or DNA vaccines or viral vectors encoding the relevant genes are 
20 injected and the protective immune responses are analyzed. 

Example 6 

. - Development Of Broad-Spectrum Vaccines Against Viral Pathogens 

This Example describes the use of DNA shuffling to obtain vaccines that can 
induce an immune response against multiple isolates of viral pathogens. 

25 A. Venezuelan equine encephalitis virus (VEE) 

VEE belongs to the alphavirus genus, which are generally transmitted by 
mosquitoes. However, VEE is an unusual alphavirus in that it is also highly infectious by 
aerosol inhalation for both humans and rodents. The disease manifestations in humans range 
from subclinical or mild febrile disease to serious infection and inflammation of the central 
30 nervous system. Virus clearance coincides that of production of specific anti-VEE 
\ — antibodiesrwhich-are'believed40"be-the-pnmary-mediators-of-protecti-ve-in™ 
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(Schmaljohn et ai (19S2) Nature 297; 70). VEE is an unusual virus also because its primar>' 
. target outside the centra! nervous system is the IvTnphoid tissue, and therefore, replication 
defective variants may provide means to target vaccines or pharmaceuticaily useful proteins 
to the immune system. 

At least seven subtypes of VEE are.icnown'that can be identified genetically ' 
and serologically. Based on epidemiological data, the virus isolates fall into two main " ' 
categories; I-AB and I-C strains, which are associated with VEE epizootics/epidemics, and 
the remaining serotypes, which are associated primarily with enzootic vertebrate-mosquito 
cycles and circulate in specific ecological zones (Johnston and Peters, In Fields Virology-, 
Third Edition, eds. B.N. Fields et ai, Lippincott'-Raven Publishers, Philadelphia, 1996). 

The envelope protein (E) appears to be the major antigen in inducing 
neutralizing' Abs. Accordingly, DNA shuffling is used to obtain a library of recdmbinani E 
proteins by shuffling the corresponding genes denved from, various strains of VEE. These 
libraries and individuals chimeras/mutants thereof are subsequently screened for their 
capacity to induce >widely cross-reacting and protective Ab responses. 

B. Flaviviruses 

Japanese encephalitis virus (JE), Tichc-bome encephalitis, virus (TBE) and 
Dengue virus are anhropod-bome viruses belonging to the Flavivirus family, which 
compnses-69 related viruses. The heterogeneity of the viruses within the family is a major 
challenge for vaccine development. For example, there are four major serotypes of Dengue 
virus, and a tetravalent vaccine that induces neutralizing Abs against all four serotypes is 
necessary. Moreover, non-neutralizing antibodies induced by infection or vaccination by 
one Dengue virus may cause enhancement of the disease during a subsequent infection by 
another serotype. Therefore, cross-protective, broad spectrum vaccines for.TBE and JE 
would provide signitlcant improvements to the existing vaccines. In this Example, the 
ability of DNA shuffling to efficiently generate chimeric and mutated genes is used to 
generate cross-protective vaccines. 

L Japanese encephalitis virus 

Japanese encephalitis virus (JE) is a prototype of the JE antigenic complex, 
which comprises St. Louis encephalitis virus, Murray Valley encephalitis virus, Kunjin virus 
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and West Nile virus (Monath and Heinz, In Fields Virology, Third Edition, eds. B.N. Fields 
et al.y Lippincoi.-Raven Publishers, Philadelphia, pp 961-1034, 1996). Infections caused by 
JE are relatively rare, but the case-fatality is 5-40%'because no specific treatment is 
available. JE is widely distributed in China, Japan, Philippines, far-eastern Russia and India 
providing a significant threat to those traveiing~irrthese areas. Cuf]^t]y~available"JE vaccine 
is produced from brain tissues of mice infected with single virus isolate. Side effects are 
observed in 10% to 30% of the vaccinees. 

To obtain chimeric and/'or mutated antigens that provide a protectivcimmune 
response against all or most of the viruses within the JE complex, DNA shuffling is 
performed on viral envelope genes. The amino acid identity within the JE complex varies 
between 72% and 93%. In addition, significant antigenic variation has been observ-ed among 
JE strains by neutralization assays, agar gel diffusion, antibody absoiption and monocianal 
antibody analysis (Oda (1976) Kobe J. Med. Sci. 22: 123; Kobyashi a!. (1984) InfecL 
Immun. 44; 1 17). Moreover, the amino acid divergence of the env'eiope protein gene an;ong 
13 strains from different Asian countries is as much as 4.2%o (Ni and r^arreti (1995) J. Cen. 
ViroL 76: 401). The resulting library of recombinant polypeptides encoded by the shuffled 
genes is screened to identify those :hat provide a cross-protective immune response. 

2. Tick-borne encephalitis virus 

The tick-borne, encephalitis virus complex comprises 14 antigenically related 
20 viruses, eight of which cause human disease, including Powassan, Louping ill and Tick - 
borne encephalitis virus (TBE) (Monath and Heinz, In Fields Virolog:^, Third Edition, eds. 
B.N. Fields et ai, Lippincbtt-Raveh Publishers, Philadelphia; pp. 96"V-'r034, 1996). TBE has" 
been recognized in all Central and Eastern European countries, Scajncimavia and Rusria, 
whereas Powassan occurs in Russia, Canada and the United States. The symptoms vary 
25 from flu-like ilhiess to severe meningitis, meningoencephalitis and meningoencephalitis with 
a fatality rate of 1% to 2% (Gresikova and Calisher, In Monath ed., The arboviruses: 
ecology and epidemiology, vol, IV, Boca Raton, FL, CRC Press,;pp. 1 77-203, 1983). 

Family DNA shuffling is used to generate chimeric envelope proteins derived 
from the TBE complex to generate crossprotective antigens. The envelope proteins within 
30 the family are 77-96% homologous, and viruses can be distinguished by specific mAbs 
(Holzmaml-e^a/T^,A^accine,4-0,J4S,^1.992).J^he-envelope-proteinxf-Powass^ ^ 



10 
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identical at the amino acid level with that of TBE, and cross-protection is unlikely, although 
epidemiological data is limited. ' 

Langat virus is used as a model system to analyze protective immune 
responses ZAZ vivo (lacono-Connors a/. (1996) Virus Res. 43: 125). Langat virus belongs to 
the TBE complex, and can be used in challenge studies in.BSL3 facilities. Serological 
studies based on recombinant envelope proteins are performed to identify immunogen 
variants that induce high levels of antibodies against envelope proteins derived from most or 
all viruses of the TBE complex. • 

3. Dengue viruses 

Dengue viruses are transmitted though mosquito bites, posing a significant 
threat to troops and civilian populations panicularly in tropical areas. There are four major ^ 
serotypes of Dengue virus, namely Dengue 1, 2, 3 and 4. A tetravaient vaccine that induces 
neutralizing antibodies against . all four strains of Dengue is required to avoid antibody- 
mediated enhancement of the disease-when the individual encounters the virus of the other 
strain. 

The envelope protein of Dengue vipjs haij been shown to provide an immune 
response that protects from a future challenge with the same strain of virus: However, the 
levels of neutralizing antibodies produced are relatively low and protection from live virus 
challenge, is not always observed. For example, mice injected with genetic vaccines 
encoding envelope protein of Dengue-2 virus developed neutralizing antibodies when 
analyzed by in vitro neutralization assays, but the mice did not survive the challenge with 
live Dengue-2 virus (Kochel et ai (1997) Vaccine 15: 547-552). However, protective 
immune responses were observed in mice immunized with recombinant vaccinia virus 
expressing Dengue 4 virus structural proteins (Bray et ai (1989) X ViroL 63: 2853). These 
studies indicate that vaccinations with E proteins work, but significant improvements in the 
immunogenicity of the protective antigens are required. 

In this Example, DNA shuffling is performed on the genes encoding the 
envelope (E) protein from all four Dengue viruses and their antigenic variants. Family DNA 
shuffling is used to generate chimeric E protein variants that induce high titer neutralizing 
antibodies against all serotypes of Dengue. The E proteins of the different dengue viruses 
share 62% to 77% of their amino acids. Dengue 1 and Dengue 3 are most closely related 
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(77% homologous), followed by Dengue 2 (69%) and Dengue 4 (62%). These homologies 
are well in the range that allows efficient family shuffling (Crameri et ai (1998) Nature 391: 
288-291), 

The shuffled antigen sequences are incorporated into genetic, vaccine vectors, 
5 the plasmids purified, and subsequently injected into'mice. The sera are collected from the 
mice and analyzed for the presence of high levels of cross-reactive antibodies. The best 
antigens are selected for further studies using in vivo challenge models to screen for 
chimeras/mutants that induce cross-protection against all strains of Dengue. 

C. Improved Expression and Immunogenicity of Hantaan virus Gi ycoproteins 

10 One of the advantages of genetic vaccines is that vectors expressing pathogen 

antigens can be generated even when the given paihogen cannot be isolated in culture. .\n 
example of such potential situation was an ouibrea;: of severe respiratory disease amoDg 
rural residents of the Southwestern United States \v:;ich was caused by a previously " 
unknown hantavirus, Sin Nombre virus (Hjelle et c. (1994) J. Virol 6S: 592). Much RNA 

15 sequence intormation of the virus was obtained wei; before the virus could be isolated and- 
charactenzed viirG, In these situations, genetic \'2ccines can provide means to generate ■ 
efficient vaccmes in a shon period of time by creating vectors encoding antigens encoded by ^ 
the pathogen. However, genetic vaccines can only work if these antigens can be properly ■ 
expressed in the host. 

20 ^ Hantaan virus belongs to the Bunyavirus family. A characteristic feature of 

this family is that their glycoproteins typically accumulate at the membranes of the Golgi' 
apparatus when expressed by cloned cDNAs, thereby reducing the efficacy of corresponding 
genetic vaccines (Maisuoka et al (1991) Cwrr Top. Microb. Immunol 169: 161-179). Poor 
expression of Hantaan virus glycoproteins on the cell surface is also one explanation for 

25 poor immune responses following injections of Hantaan virus genetic vaccines. 

In this Example, family DNA shuffling is used to generate recombinant 
Hantaan virus derived glycoproteins that are efficiently expressed in human cells and that 
can induce protective immune responses against the wild-type pathogen. Nucleic acids that 
encode the Hantaan virus glycoprotein are shuffled with genes that encode other 

30 homologous Bunyavirus glycoproteins. The resulting library is screened to identify proteins 

that-arereadily-expressednn-hunian-cellsT— The-screening-is-perfoni:ied-using-a-duah — — 
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expression vector thai enables simultaneous analysis of transfection efficiency and 
expression of fusion proteins that are PIG-linked to the cell surface (Whitehom et al: (1995) 
Biotechnology (N Y) 13:1215-9). ' ■ ' ,/ 

Flow cviometn' based cell sorting, is used to select Hantaan' virus glycoprotein 
5 • variants that.are efficiently expressed in mammalian cells. The corresponding sequences are 
then obtained^by PCR or plasmid recovery. These chimeras/mutants are further analyzed for 
■ their capacity to protect wild mice against Hantaan virus infections. 

Example 7 

DNA Shuffling Of HSV-1 And HSV-2 Glycoproteins B And/Or D As iVIeans To Induce 
10 Enhanced Protective Immune Responses 

This Example descnbes the use of DNA shuffling to obtain HSV glycoprotein 
B (gB) and glycoprotein D (gD) polypeptides that exhibit improved ability to induce 
protective immune responses upon administration lo a mamma!.. Epidemiological studies 
have showTi.that pnor infections with HSV-1 give partial protection. against infections with 
15 HSV-2, indicating existence of cross-reactive immune responses. Based on previous 

vaccination studies, the main' immunogenic glycoproteins in HSV appear to be gB and gD, 
which are encoded by 2.7 kb and 1 .2 kb genes, respectively. The gB and gD genes of HSV- 
1 are about 85%, identical to the corresponding gene of HSV-2, and the gB genes of each 
share little sequence identity with the gD genes. Baboon HSV-2 gB is appr. 75% identical to 
20 human HSV-1 or, -2 gB, with rather long stretches of almost 90%o identity. In addition, 60- 
75% identity is found in portions of the genes of equine and bovine herpesviruses. ^ 

Family shuffling is employed using as substrates nucleic acids thai encode gB 
and/or gD 'from HSV-l and HSV-2. Preferably, homologous genes are obtained from HSVs 
of various strains. An alignment of gD nucleotide sequences fronl HSV-1 and Uvo strains of 
25 HSV-2 is shown in Figure 7. Antigens encoded by the shuffled nucleic acids are expressed 
and analyzed in vivo. For example, one can screen for improved induction of neutralizing 
antibodies and/or CTL responses against HSV-l/HSV-2. One can also detect protective 
immunity by challenging mice or guinea pigs with the viruses. Screening can be done using 
pools or individuals clones. 
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Example 8 

Evolution Of HIV Gpl20 Proteins For Induction Of Broad Spectrum Neutralizing Ab 

Responses 

This Example describes the use of DNA shuffling to generate immunogens 



5 that crossreact among different strains of viiaTses^liniikelhe wild-type immunogens. ^ 
Shuffling two kinds of envelope sequences can generate immunogens that induce 
neutralizing antibodies against a third strain. 

Antibody-mediated neutralization of HIV- 1 is strictly type-specific. 
Although neutrahzing activity broadens in infected individuals over time, induction of such 
10 antibodies by vaccination has been shown to be extremely difficult. .Antibody-mediated 

protection from HIV-1 infection in vivo correlates with antibody-mediated neutralization of 
virus in vitro. 

Figure S illustrates the generation of libraries of shuffled gpl20 genes. gpl20 
genes denved from HIV-iDH12 and HIV-1IIIB(NL43) are shuffled. The chimenc/mutant 

15 gp 120 genes are then analyzed for their capacity to induce antibodies that have broad 

spectrum capacity to neutralize different strains of KIV. Individual shuffled gpl20 genes are 
incorporated into genetic vaccine vectors, which are then introduced to mice by injection or 
topical' application onto the skin. These antigens can also be delivered as puritled 
recombinant proteins. The immune responses arc measured by analyzing the capacity of the 

20 mouse sera to neutralize HIV growth in vitro. Neutralization assays are performed against 
HIV-1DH12. HIV-IIIIB and HTV-I89.6. The chimeras/mutants that demonstrate broad 
spectrum neutralization are chosen for further rounds of shuffling and selection. .Additional 
studies are performed in monkeys to illustrate the capacity of the shuffled gpl20 genes to 
provide protection for subsequent infection with immunodeficiency virus. 

25 Example 9 

Antigen Shuffling of the Hepadnavirus Envelope Protein . 

The Hepatitis B virus (HBV) is one of a member of a family of viruses called 

hepadnaviruses- This Example describes the use of genomes, and individual genes from this 

family are used for DNA shuffling, which results in antigens having improved properties. 
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A, Shuffling of Hepadnavirus envelope prote in genes 

. ' The envelope protein of the HBV assembles to forrn panicles that carry 

antigenic 'Structures collectively known as the Hepatitis B surface antigen (HBsAg;'this term, 
is also used to designate the protein itself). Antibodies, to the major antigenic site, designated 
the 'V' epitope (which is found in the envelope domain called S), are capable of neutralizing 
the virus. Immunization with the HBsAg-bearing protein thus serves as a vaccine against 
viral infection. The HB V envelope also contains other antigenic sites that can protect against 
viral infection and are potentially vital components of an improved vaccine. The epitopes are 
pan of the envelope protein domains known as preS 1 /and preS2 (Figure 9). 

DNA shuffling of the. envelope gene from several members of the *. ' 
hepadnavirus family is used to obiain.more immunogenic proteins. Specifically, the genes' 
from the following' hepatitis viruses are shuffled: ' ' / 

• ■ the human HBV vinjses, subt>pe3 av'^^v and £3i?'U''2 - ' ' . ^ 

• ■ a hepatitis vinis isolated from chimpanzee ' 
' ' \ . • a hepaniis virjs isolated from gibbon 

e a hepatitis virus isolated from woodchuck , 

If desjred, genes from other genotypes, of the human virus are ayailable for • 

inclusion in the DNA shuffling reactions. Likewise, other animal hepadnaviruses are » . 

available. , , " , ^ , 

To promote the efneiency of the formation of chimeras resulting from. DNA 
shuffling, sorhe artificial, genes are made: ■ ; • \ 

' • In one cak, a synthetic gene is;made that contains the HBV envelope 

sequences, except for those codons which specify amino acids found in ' 
the chimpanzee aind gibbon genes. For those- codons/the chi^ 
■ ^gibbon sequence is used, , . . ' 

\ • In a second case, a synthetic gene is synthesized in which the preS2 gene 
sequence from the human HBV adw2 strain is fused with the woodchuck 
S region. 
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o In a third case, all the oligonucleotides required to chemically synthesize 
each of the hepadnavirus envelope genes are mixed in approximately 
equal quantities and allowed to anneal to form a library of sequences. 
After DNA shuffling of the hepadnavirus envelope genes, either or both of 
two strategies are used to obtain improved HBsAg antigens. . ' ■ _ ~ 

Strategy A: .Antigens are screened by immunizing mice using two possible 
methods, The genes are injected in the form of DNA vaccines, i.e., shuffled envelope genes 
carried by a plasmid that comprises the genetic regulatory elements required for expression 
of the envelope proteins. Alternatively, the protein is prepared ft-om the shuffled genes and 
used as the immunogen. 

The sequences that give nse to greater immunog^nicity for either the preS 
preS2- or S-bome HBV antigens are selected for a second round of shuffling (Figure 10). 
For the second round, the best cai^ididates are chosen based on their improved antigeniciiy 
and their other propenies such as higher expression level or more efrlcient secretion. 
Screening and further rounds of shuffling are continued until a maximum optimization for 
one of the antigenic regions is obtained. 

The individually optimized genes are then used as a combination vaccine for 
the induction of optimal responses to preSl-, preS2- and S-bom epitopes. 

Strategy B; After isolation of the individually optimized genes as in Strategy ■ 

A, the preSl, preS2 and S candidates are shuffled together, or in a pairwise fashion, in 
further rounds to obtain genes which encode pro teinsjhat demonstrate improved, 
immimogenicity for at least nvo regions containing KBsAg epitopes (Figure 11). 

B. Use of HBsAg to carry epitopes from unrelated antigens 

■ ■ Several of the characteristics of the HBsAg make it a useful protein lo carrv , 
epitopes drawn from other, unrelated antigens. The epitopes can be either B epitopes (which 
induce antibodies) or T epitopes drawn from the class I type (which stimulate CD8^ T 
lymphocytes and induced cytotoxic cells) or class II type (which induce helper T 
lymphocytes and are important in providing immunological memory responses. 
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'J. Bcell epitopes 
• . ' Amino acid sequences of potential B epitopes are chosen from 'any pathoge 

. • . Such sequences are often knovvTi to induce antibodies, but the immunogenicity is weak or 
. otherwise unsatisfactorv' for preparation 'of a vaccine. These sequences can also be ' 
5 mimbtopes, which have been selected based on their ability to have a certain antigenicity or ■ 
, inuhunogenicity. . , 

; The amino ,acid codmg sequences are added; to^a hepadnaviru^ envelope gene. 
The heterologous sequences can either replace certain envelope sequences, or be added in 
[ addition to all the envelope sequences. The heterologous epitope sequences can be placed at. 
'10 ' ^ any position in the envelope gene. A preferred position is the region of , the envelope. gene 
> that encodes the major ''a" epitope of the HBsAg (Figure 12). This region. is likely to be 
• , .exposed on the external side of the particles formed by the envelope protein,, and thus will 
. - * expose the heterologous epitopes,. - ■ ■ , , 

' DNA shuffling is earned out on the envelope gene sequences, keeping the 

1 5 sequence of the heterologous epitopes constant. Screening is earned out to choose candidates 
that are secreted into the culture medium after transfeciion of plasmids. from the shuffled 
library into cells in tissue culture. 
' . ' Clones that encode r secreted protein are then tested for immunogenicity in 

mice either as a DNA vaccine or as a protein-antigen, as described above. Clones that -give 
• .20 . an improved induction of antibodies to the heterologous epitopes are chosen for further 
. rounds of DNA shuffling. The process is continued ^until the immunogenicity of the 
' heterologous epitope is sufficient for use as a vaccine against the pathogen from which the 
heterologous epitopes were derived. . , ( 

2. Class I epitopes • 
25 ^ ^ N^C Class I epitopes are relatively short^linear pep^^^ 

generally between 6 and 12 amino acids amino acids in length, most often 9 amino acids in 
length. These epitopes are processed by antigen-presenting cells either after synthesis of the 
epitope within the cell (usually as part of a larger protein) or after uptake of soluble protein 
by the cells. 

30 Polynucleotide sequences that encode one or more class I epitopes are 

inserted into the sequence of a hepadnavirus envelope gene either by replacing certain 
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envelope sequencr'^s, or by inserting the epitope sequences into the envelope gene. This is 
typically done by modifying the gene before DNA shuffling or by including in the shuffling 
reaction certain ohgonucleotide fragments. that encode the heterologous epitopes as well as 
sufficient flanking hepadnavirus sequences to be incorporated into the shuffled products. , 



5 


Preferably, the heterologous class I epitopes are placed into different 
positions in the several hepadnavirus genes used for the DNA shuffling reaction. This will 
optimize the chances for finding chimeric gene carrying the epitopes in an optimal position 
for efficient presentation. 

i. Class II epitopes 


- 


10 


taken up by antigen presenting cells, rather than svTithesized within the cell Preferably, such 
epitopes are incorporated into a carrier protein such as the HBV envelope that can be 
produced in a soluble form or which can be secreted if the gene is deli vered in the form of a . ' 
DNA vaccine, ' - ^^ 




15 


Polynucleotides that. encode heterologous class II epitopes are insened into - 
regions of the hepadnavirus envelope genes that are not involved in the transmembrane - 
stnj.cture of the protein. DNA shuffling is performed lo obtain a secreted protein that also 
carries the class II epitopes. WTien injected as a protein, or when the gene is delivered as a 
DNA vaccine, the protein can be taken up by antigen presenting, cells' for processing of the 




•20 • 


class 11 epitopes. 






~ ^ ^Example 10 * 
Evolution of broad spectrum vaccines against Hepatitis C Virus. 






Antigenic heterogeneity of different strains of Hepatitis C Virus (HCV) is a 






uiajui piuuitui lu ucvcjupiiiciu uj. ciin.^icni vaccines agamsi riL- v , *-\ntiDOQies or u i Ls 




. 25 


specific for onestrain of HCV tN^Dically do not protects against other strains. Multivalent 
vaccine antigens that simultaneously protect against several strains of HCV would be of 
major importance- when developing efficient vaccines against HCV. 

The HCV envelope genes, which encode envelope proteins El. and E2, have 
been shown to induce both antibody and lymphoproHferative responses against these 




30 


antigens (Lee et al, (1998) J, Virol, 72: 8430-6), and these responses can be optimized by 






DNA shuffling. The h\T3er\^ariable region 1 (HVRl) of the envelope protein E2 of HCV is 
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. . the most vanabie antigenic fragrrient in the whole virar-genome and is pnmarily responsible 
for the large inter- and intra-indi vidua) heterogeneity of the infecting virus"(Puntoner6 et ai 
(1998) EMBOJ. 17: 3521-33). Therefore, the gene encoding E2 is a particularly usefiiF 
target for evoiution by DNA shuffling. - 

—5— — — ^ ^DNA-shuffling-of-H&V-antigens-such-as-nue-leoea^ 

EI, E2, provides a means to generate multivalent HC V vaccines that simultaneously protect 
against several strains of HCV. These antigens are shuffled using the family DNA shuffling 
approach. The starting genes wi ll be obtained from various natural isolates of HCV. In '' 
..addition, related genes from other viruses can be used to increase the number of differeni / 
10 recombinants that are generated., A library of related, chimeric variants of'HCV antigens are 
then generated and this library will be screened for induction of widely crossreactive - V" 
immune responses. The screening can be done'direciiy m vivo by injecving individual 
vanants into tesi animals, such as mice or monkeys. Hither purified recombinant protehis;or ' 
DNA vaccines encodmg the relevant genes are injecied. Typically, a booster injection is 
15 given 2-3 weeks after the ilrsi injection. Thereafter, ihe sera of the test animals are cojiected' 
and' these sera are tested for .the presence of antibodies that react againsi multiple HCV -virus 
isolates. : ■ . • ' , . ' 

Beforenhe //: vivo testing is initiated, the antigens can bepre-enriched //r v/rro 
for antigens that are recognized by polyclonal antisera derived from previously infected • 
20 patients or test animals. Alternatively, monoclonal antibodies that are specific for various, 
strains of HCV are used. The screening is performed using phage display or ELISA assays. 
For example, the antigen variants are expressed on bacteriophage M13 and the phage are 
then incubated on plates coated with antisera'derived from patients or rest animals infected 
with various HCV isolates. The phage that bind to the antibodies are ihen eluted and'fiir^her 
25 analyzed in test animals for induction of crossreactive antibodies. 

Example 11 

EvoIutioQ of chimeric allergens that. induce broad immune responses and have reduced, 

risk of inducing anaphylactic reactions 

Specific immunotherapy of allergy is performed. by injecting increasing^ 
30 amounts of the given allergens, into the patients. The therapy typically alters the.types of ~ 

allergen-specific immune responses from a dominating T helper 2 (Th2) type response to a , 
^dominating-T-helper^-(-THl-)-type-responser-However— ^ 
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increased levels of IgE antibodies specific for the allergens, the iminunotherapy of allergy 
involves a risk of IgE receptor mediated anaphylactic reactions. 

■ ' T helper (Th) cells are capable of producing a large number of different 
.cytokines, and based on their cyiokine synthesis pattern Th cells are divided, into two subsets 

5, (Paul and Seder (1 994) Ce// 76: 24 1 -251). ThI cells produce high levels of IL-2'and'7FN- 
gammaand no or minimal levels of IL-4, IL-5 and IL-13. In contrast, Th2 cells produce * 
- high levels of IL-4, IL-5 and IL-13, whereas IL-2 and IFN-gamma production is minimal or 
absent. ThI cells activate' macrophages, dendritic cells and, augment the cytolytic activity of 
' ' V GD8+ cytotoxic T lymphocytes and NK cells (Id.), whereas Th2 cells provide efficient help 
10 * for B cells arid they also mediate allergic responses, due to the capacity of Th2 cells to induce 
IgE isotype switching, and differentiation of B cells into IgE secreting cell (De Vries and 

■ Punnonen (1996) In Cytokine regulation of humoral immunity: basic and clinical aspects. 
: Eds. .Snapper, CM., John VViley & Sons, Ltd;, West Susse.x, LTv, pp. 195-215 

. This Exampie describes methods to generate chimenc allergens that can 
15 broadly .modulate allergic immune responses. This can be achieved by DNA shuffling of 
related allergen genes to generate chimeric genes.. In addition, chimeric/mutated allergens 
are less likely to be recognized by preexisting IgE antibodies of the patients. Importantly, 
allergen variants that are not recognized by IgE antibodies can be selected using patient sera 

, .^andnegative selection (Figure 13). /' 

20 ; As one example, chimeric allergen vanants of Derp2, Der f2, Tyrp2 Lep d2 , 

and Gly d2 allergens are generated. These house dust mite^allergens are very common in 
• exacerbating allergic and asthmatic symptoms, and improved means to downregulate such 
allergic immune responses are desired. House dust mites can be used as sources of the genes. 
-.The corresponding genes are shuffled using family DNA shuffling and a shuffled library is 
25 generated. Phage display is used to exclude allergens that are recognized by antibodies from 
allergic individuals. It is particularly important is to exclude vanants that are recognized by 
IgE antibodies. Phage expressing the allergen variants are incubated with pools of sera 
' derived from allergic individuals. The phage that are recognized by IgE antibodies are 

removed, and the remaining allergens are further tested in vitro and in vivo for their capacity 
30 to activate allergen-specific human T cells (Figure 14). Because immunotherapy of allergy is 
believed to function through induction of a dominating ThI response as compared to allergic 
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Th2 response, efficient T cell activation and induction of a ThI t>'pe response by allereen 
variants is used as a measure o f the efficacy of the. allergens to modulate allergic T cell 
responses. 

The optimal allergen vanants are then further tested in vivo by studying skin 

~*"5 responses~after-injections'to^the-skinT-:Vstrong^inflammator>'Tesponse-aroundnhe'injection^ 

site is an indication of efficient T cell activation, and the allergen variants that induce the 
most efficient delayed type T cell response (typically observed 24 hours after the injection) 
are the best candidates for further studies in vivo to identify allergens that effectively 
downregulate allergic immune responses. Accordingly, these allergen variants re analyzed 
10 for their capacity to inhibit allergic responses in allergic, atopic and asthmatic individuals. 
The screening of allergen variants is further illustrated in Figure 13 and Figure 14. 

Example 12 

Evolution of cancer antigens that induce efficient anti-tumor immune responses 
Several cancer cells express antigens '.hai are present at significantly hisher 
15 levels on the malignant cells than on other cells in the body. Such antigens provide excellent 
targets for preventive cancer vaccines and immunotherapy of cancer. The irnmunogenicity 
of such antigens can be improved by DNA shuffling. In addition; DNA shuffling provides 
means to improve expression levels of cancer antigens. 

This Example describes methods to generate cancer antigens that can 
20 efficiently induce anti-tumor immune responses by DNA shuffling of related cancer antigen 
genes. Libraries of shuffled melanoma-associated glycoprotein (gp 100/pmel 1 7) genes 
_ (Huang er al. (1998) J. Invest. Dermaiol. Ill: 662-7) are generated. The genes can be 
isolated from melanoma'cells obtained from various patients, who may have mutations of 
the gene, increasing the diversity in the starting genes. In addition, a gplOO gene can be 
25 isolated from other mammalian species to further increase the diversity vif starting, genes. A 
typical method for the isolation of the genes is RT-PCR. The corresponding genes are 
shuffled using single gene DNA shuffling or family DNA shuffling and a shuffled library is 
generated. 

The shuffled gp 1 00 variants, either pools or individual clones, are 
30 subsequently injected into test animals, and the immune responses are studied (Figure 15). 
The shuffled antigens are either expressed in £. coll and recombinant, purified proteins are 
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inje'cted, or the antigen genes are used as components of DNA vaccines. The immune 
response can be analyzed for example by measuring anti-gplOO antibodies, as previous 
studiesindicate that the antigen can induce specific antibody responses (Humg et ai, 
supra:). Alternatively, the test animals that can be chalienged by malignant cells expressing 
5 gplOO. .Animals that have been efficiently immunized will generate cytotoxic T cells specific 
for gp 100 and will survive the challenge, whereas in non-immunized or poorly immunized 
animals the malignant cells will efficiently grow eventually resulting in' lethal expansion of 
the cells. Furthermore, antigens that induce cytotoxic T cells that have the capacity to kill 
' ' cancer cells can be identified by measuring the capacity of T cells derived from immunized 
10 animals to kill cancer cells in vitro. Typically the cancer, cells are first labeled with 

radioactive isotopes and the release of radioactivity is an indication of tumor cell killing after 
incubation in the presence of T cells from immunizec animals. Such cytotoxicity assays are ■ 
known in the art. 

The antigens that induce highest leveis of specific ariiibodies and'or can 
15 protect against the highest number of malignant cells can be chosen ibr; additional rounds of 
shuffling and screening. Mice-are, useful test animals because large numbers of antigens can 
be studied. However, monkeys are a preferred test animal, because the MHC molecules of 
monkeys are very similar to those of humans. 

To screen for antigens that have optimal capacity to activate antigen-specific 
20 T cells, peripheral blood mononuclear, cells fi-om previously infected or^immunized humans 
individuals can^be used. This is a particularly useful method, because the MHC molecules 
that will present the antigenic peptides are human MHC molecules. Shuffled cancer antigens 
that induce cytotoxic T cells that have the capacity to kill cancer cells can be identified by 
measuring the capacity of T cells derived firom immunized animals to kill cancer cells in 
25/ vitro. Typically the.cancercells are.first labeled with radioactive isotopes and the release of 
radioactivity is an indication of tumor cell killing after incubation in the presence of T cells 
from immunized animals. Such cytotoxicity assays are known in the art. 

Example 13 

Evolution of autoantigens that induce efficient immune responses 
. 30 Autoimmune diseases are characterized by an immune response directed 

against self antigens expressed by the host. Autoimmune responses are generally mediated 
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byTHi cells that produce high levels of IL-2 and rFN-gamma. Vaccines that can direct- 
/autoantigen specific T cells towards Th2 phenotype producing increased levels of IL-4 and ' 
■ IL-5 would be beneficial. For such vaccines to work, the vaccine antigens have to be able to 
efficiently activate specific T cells. DNa shuffling can be used to generate antigens that 
5 have such propertier^ToopIimall^rinducyTH^^ 

coadminister cytokines that have been shown to enhance Th2 cell activation and 
differentiation, such as IL-4 (Racke et ai (1994) J. Exp. Med. ISO: 1961-66). 

This Example descnbes methods for generating autoantigens that can 
efficiently induce immune responses. DNA shuffling is performed on related autoantigen 
• 1 0 genes. For example, Hbraries of shuffled myelin basic proteins, or fragments thereof (Zamvil 
andSteinman {\990) Ann. Rev. Immunol, 8: 579-621); Brooke et al. (1996) Nature 379: 343- 
46) are generated. MBP is considered to be an imponant autoantigen in patients with 
multiple sclerosis (MS). The genes encoding MBP from at least bovine, mouse, rat, guinea 
pig and human have been isolated providing an excellent starting point for family shuffling. 
15 A typical method for the isolation of the genes is RT-PCR. The shuffled xMBP variants, 

either pools or individual clones, are subsequently injected into test animals, and the immune 
responses are studied. The shuffled antigens are either expressed in E, coli and recombinant, 
purified proteins are injected, or the antigen genes are used as components of DNA vaccines ' ' 
or viral vectors. The immune response can be analyzed for example by measuring anti-MBP 
20 antibodies by ELISA. Alternatively, the lymphocytes derived from immunized test animals 
are activated with MBP, and the T cell proliferation or cytokine synthesis is studies. A 
seiisjtive assays for cytokine synthesis is ELISPOT (McCutcheon ai (1997) J Immunol 
Methods 210: 149-66). Mice are useful test animals because large numbers of antigens can 
be studied. However, monkeys are a preferred test animal, because the MHC molecules of 
25 monkeys are very similar to those of humans. 

^ To screen for antigens that have optimal capacity to activate MBP specific T 

cells peripheral blood mononuclear cells from patients with MS can also be used. This is a 
particularly useful method, because the MHC molecules that will present the antigenic 
peptides are human MHC molecules. Shuffled antigens that activate MBP specific T cells 

30 can be identified by measunng the capacity of T cells derived fi-om MS patients to 

proliferate or produce cytokines upon culture in the presence of the antigen variants. Such 
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assays are known in the art. One such assay is ELISPOJ (McCutcheon et ai, supra.). , An 
indication of the efficacy of an MBP vanant to activate specific T cells is also the degree of 
skin infiammation when the antigen is injected into the skin of a patient w^ith MS. Strong 
inflammation is correlated with strong, activation of antigen-specific T ceils. Improved ■ 
. activation of MBP specific T cells, particularly in the presence of IL-4, is likely to result in 
enhanced Th2 cell responses, which are beneficial in the treatment of MS patients. 

Exampiel4 

Method of Optimiziiig the Immunogenicity of Hepatitis B Surface Antigen 

This 'Example describes methods by which the envelope protein sequence of 
the hepatitis B virus can be evolved to provide a more immunogenic surface antigen. Such a 
protein is important for vaccination of low responders and for immunotherapy of chronic 
hepatitis B. ' ' . • ' 

BackgrouQd 

Current HBV vaccines (Merck, SKB) are based on the immunogenicity of the 
viral envelope protein and contain the Major (or Small) form of the envelope protein 
produced as particles in yeast. These particles .induce antibodies to the major surface antigen 
(HBsAg) which can protect against infection when antibody levels are at least 10 milli- 
Intemaiional Units per milliliter (mU/ml). These recombinant protein preparations are not 
capable of inducing humoral immunity in chronic Carriers (some 300 million cases 
worldwide) the induction of which would be important to control virus spread. Moreover, 
certain individuals respond poorly to the vaccine (up to 30-50% of vaccinees in some 
groups) and do not develop protective levels of antibody. The inclusion of the natural 
epitope sequences contained in the Middle or Large forms of the viral envelope protein has 
been us'ed as a method to increase the immunogenicity of vaccine preparations. An . ' ' 
alternative method is to introduce new [i.e., not present in the natural virus sequence) helper 
T-cell epitopes into the HBsAg sequence using, DN A shuffling technology. 

Method 

DNA sequences of HBsAg from different subtypes of HBV {e.g., ayw and 
adr) and the related woodchuck hepatitis virus are prepared for shuffling. Comparison of the 
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^ genes encoding these proteins suggests that recombination would occur at least ten times 
within 850 base pairs when shuffling the ayw and woodchuck hepatitis virus (WHV) DNA 
sequences. Nucleotide and amino acid sequences of portions of different subtypes of HBV 
are shown in Figure 17. 
5 ^Tfie sequencyo'f Thelriain 

be retained in the protein sequence by including the coding sequences of the external **a" 
loop in the final protein preparation. Peptide analogue(s) for the ''a" epitope of HBsAg have 
been described (Neurath ei al. (1984) J, ViroL Methods 9:341-346), and the immunogenicity 
of the "a" epitope has been demonstrated (Bhatnagar ai (1982) Proc. Natl, Acad. Sci\ 
10 USA 79: 4400-4404). HBsAg and VVTIsAg share the major *'a" determinant, 'and chimps can 
be protected by both antigens (Cote ei ai. (1986)7. Virol. 60: 89,5-901). Likewise, imponanr 
CTL epitopes can be included in the protein in a denned way. 

One can also easily introduce B or T (helper or CTL) epitopes from other 
antigens into the shuffled HBsAg sequence. This may focus the immune response to ceriai.n 
15 epitopes, independent of other potentially dominant epitopes from hhe same protein. 
Furthermore, the availability of the **a" loop on the HBsAg may provide a region of the 
envelope protein into which other artificial antigens or mimotopes could be included. 

In all cases where a novel HBV envelope sequence is prepared to include a 
specific epitope (from HBV, another pathogen or a tumor cell), shuffling of the surrounding ' 
20 sequences in the HBV envelope will serve to optimize expression of the protein and help to 
ensure that the immune response is directed to the desired epitope. 

Several methods of analyzing and utilizing shuffled HBsAg sequencers are 
described below. 

A. Modulating expression levels of HBsAg 
25 Shuffled HBsAg sequences are introduced into cells in culture and the ability 

to direct expression of secreted HBsAg (measured with clinical kits for HBsAg expression) 
is evaluated. This can be used to identify shuffled HBsAg sequences which exhibit 
optimized HBsAg expression levels. Such coding sequences are particularly interesting for 
DNA vaccination.' 
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B. Circumventing loh^ responsiveness to the HBsAg 

Shuffled HBsAg sequences are-evaluatedTor their ability to induce 
immune response to the clinically relevant HBsAg epitopes. This can be done using mice^of. 
the H-2s and H-2fhaplotypes, which respond poorly ornot at all to HBsAg protein 
immunization; In these exp^eriments. one can verify that antibodies are generated to, the main 
^^a" epitope in^he S protein, and a second protective epitope in the PreS2 region (a linear ■ ■ 
sequence). , ' ' ' ' • 

The PreS2 and S coding sequences for the envelope protein (HBsAg) from 
the HE V ayw subtype (plasmid p.CAG-M-Kan; WTialen) arid the WHY (plasmid pWHVS' 
from AT.CC) are arnplified from the two plasmids by PGR and shuffled. Examples of 
suitable primers for PGR amplification are shown in Figure 1 8, The shuffled library of 
sequences is cloned into an HBsAg-expression vector and individual colonies are chosen for 
preparation 'Of plasmid DNA. The DNA is adminisiered to the,test animals and vectors . ' 
whjch induce the desired, immune response'aie icemitled and r^ 

C Presentation of natural HBsAg CTL epitopes, by evolved HBsAg proteins 
' ' This example describes methods of using the evolved' HBsAg'protein to, 

present natural HBsAg CTL epitopes. Shuffling is used to increase overall immunogenicity 
of the HBsAg protein, as discussed above. However, some of the evolved HBsAg sequences' ^ 
are-replaced with class I or class JI epitope sequences from the naturaTHBsAg, protem in': 
order to stimulateimmunoreactivity specifically.'io. these natural viral epitopes. 
Altematively, the natural viral epitopes can.be added to the evolved protein without loss of 
immunogenicity of the evolved HBsAg. 

Z), r Expression of tumor-derived CTL epitopes by evolved HBsAg proteins 

This exainpledescnbes methods of using the evolved HBsAg pr9iein is , used 
to express tumor-derived GTL epitopes. ; The overall immunogenicity of the HBsAg protein' ' 
is increased by shuffling. How.ever, some of the evolved HBsAg sequences are replaced with 
class I or class II epitope sequences from tumor cells in order to stimulate immunoreactivity 
specifically to these natural viral epitopes. Alternatively, the tumor cells epitopes can be 
added to the evolved protein without loss of immunogenicity of the evolved HBsAg.. 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 PCT/US99/02944 

■ . ■ 1:3 

E, Expression ofmimotope sequences by the HBsAg 

■ This exampie descnbes the use of an e\'oived HBsAg protein' for expression 
ofmimotope seqiicnces. Aga:n*. ihe evoi\'ed HBsAg protem is used to increase overall 
immunogenicity of the protein. Ho\ve\'er. some of the evolved HBsAg seauences are 
5 replaced u'ith mimoiooe sequences lo stimulate inimun6reacii\'][ y specifically to the natural 
sequence which cross reacts \^'iih the mimotope. Alternan\'ei\'. the mimotope sequences can 
be; added to the evoi\'ed protein v.iihoui loss of immiunogenicit>' of the evolved HBsAe. 

Example 15 

Fusion Proteins Of The HBsAg Polypeptide and HIV gpl20 Protein 

10 This Exampie describes the preparation of fusion proteins (**chimcras") 

formed from the HBsAg poiypeDtide and the exiraceiluiar iragmcnt gpI^Q of the HI\' 
envelope protein, and their use as vaccines. V. 

Background 

v\7ien used as a \'accine. recombinant monomenc gpl20 has failed to induce 
15 antibodies that ha\'e strong neutrahzing activity with primar\' isolates of the HIV \'iras. Ilvhas 
been suggested that oiigomien: forms of the HIV envelope protein which expose cenain 
regions of the teniar>' structure \\'0'a\A be better able to elicit virus-neutralizing anlibodies.^^ 
(Pamn^/c/. (\991) ImmiwoL Leu. 57: 105-112; VanCott a/. (1997)7, ViroL 71: 4319v. 
^330; ■ . • 

20 In this Exaniple. ONA shuffling is applied to this problem, in order to obtain 

gpl20 poivpeptides which adopt conformations slightly diffcrent from those of previous 
preparations of recombinant gpl20. To allow the individual gpl20 molecules lo interact as 
oligomers, a fusion is prepared between gpl'20 sequences lon.'the N-termir/Js of the fusion) 
and HBsAg sequences (on the C-terminal of the fusion). 
25 • . The N-terminai peptide sequence of the S region of the HBsAg polypeptide is 

a transmembran.e structure which is locked into the membrane of the endoplasmic, reticulum, 
The actual N-terminus of the S region as well as the preS2 sequences are located in the 
lumenal pan of the ER. They are round on the outside of the final HBsAg particles. Bv - 
placing the gp 120 sequences on the N-termin'us of the HBsAg preS2 or S sequences, the 
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gpl 20 sequences are also located on. the outside of the panicles. The gp 120. molecules can 
thus be brought together in three-dimensional space to interact as in the' virus. . 

Since the exact conformation of the final chimera which will have the most 
appropriate immunogenicity cannot be predicted,. DNA shuffling is. employed. The ' 
5 sequences ofthe HBsAg polypeptide, which functions as a scaffold, and of gp 120 are both ' ■ 
, shuffled., Screening ofthe shuffled products can be performed -by ELIS A assay- using , . 
antibodies (polyclonal or monoclonal) which have previously been determined to have virus 
neutralizing activity. 

Method 

• 10 The sequences encoding the gp 120 fragment of the HIV envelope protein , are 

preferably prepared as a synthetic gene to include cocons which are optimal for gene^ 
expression in manrimals. The gp 120' sequence will typicallv. include a signal sequence on its 
N-terminal end. ' - ■ . ' ' , - ' 

The gp.i20 sequences are inserted into ;he preS2 region of an HBsAg- ^ 

.15 expressing plasmid. In- the preS2 region of the plasrnid pMKan and its derivatives, ah EcqRl 
site and ^n Xhol site are available. for cloning. The gpi 20 sequences can be inserted between 
these two sites, which bnngs the gp 120 closer to the start'of the S coding sequences, or into 
the i^coRI site alone, which leaves a spacer sequence of about, 50 amino acids beuveen the ■ 

■ gp 120 sequence and the start ofthe S.region ofthe HBsAg.; These two differentcloning , 
'20 ' ' strategies will give rise to chimeric molecules in which the gpl20'sequences are located at , 

different distances from the transmembrane region ofthe HBsAg sequence. This may be 

■ advantageous in allowing the gp 120 sequences to adopt conformations which are more . : 
' suitable immunogens than monomeric'gp 120. 

' . • DNA shuffling of the' entire chimeric sequence is , canned out. Fam 
'25 • shuffling is preferred; thisjnvolves the preparation of several gpl20-HBsAg fusion proteins 
in which different gpl20 and HBsAg (.or VVHV) sequences are used. An alignment of ■ 
HBsAg nucleotide sequences is shown in Figure 19. After shuffling ofthe different 
sequences, the products are cloned into an expression vector such as pMKan; Pools of clones 
from the library of shuffled products are trarisfected into cultured cells and the secretion of 
30 chimeric proteins is assayed with broadly reactive antibodies to gpl20. Positive clones can 
be further evaluated with particular antibodies that have demonstrated HIV neutralizing 
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aaivirv. :br exaj-npie the'an:i-CD4 b:nc-z coma::: recon:i:;nan: hunnan inonoclom} 
antibooy. igGlbi: (Kcssk: c: i 199"^ AIDS Res. Hun:. Re:rov:nises. !; :,3::ooS2; ^ 
Robcn eicL {199 J. V^roi 6S: -:S;:--S2Si. Candida:- clones ca;^ :hen be usee :o iniinuniz 
niice and :he annsenin: :>z^ia-ned is evai::a:ed :c: Hi\' '/■"is-nearraiizing acri'.'::y in viiro 

5 assa \ s. . \ ^ _^ 

Because t:ie -iir 1-0 n*cieciiie ;aL:pro\. ; 1 00 an^nio acids) is larger :n size ihan 
rhe monomenc HBsAg preS2-S proie-n (2S: aniino acids), i: is iikely thai not ever)- HBsAg 
monomer in an aggregated panicle will contain a gp 1 20 sequence, [niemal inirianon of 
protein synthesis can take place on the HBsAg coding sequences at the initiaior methionine 
10 that marks the beginning of the S region. Thus, the chimeric molecule (which covw^axxs, the 
. gpI20 sequenc-esi will be mixed in :he ceil v/iih rhe S region and rhe muinmenc oanicies 
should assembie ".viih an appropriaie numaer crchimen: noi>T;eptides and na::\ e HBsAg S 
monomers. Alternatively, a:: S-expressmg piasmic ca:: be mixed win- :he piasmid expressing 
the chimera, or a single piasmid which expresses the cnimera and ihe S ibnn cai:i be 
15 constructed. A diagram of the resulting panicles is inown in Figure 2U. 

Example 16 

DNA Shufning Of HSV-1 And HSV-2 Glycoproteins B And/Or D As Means To Induce 
Enhanced Protective Immune Responses 

This Exam.ple descnbes the use of DNA shuffling to obtain HSV giycoprotein-' 
20 B (gB) and glycoprotein D (gD) poivpepiides that exhibit improved ability lo induce 

protecuve immune responses upon administration to a mammal. Epidemioiogicai suidies 

have shown that prior infections yviih HSV-1 give panial protection against infections with 

HSV-2, indicating existence of cross-reactive im.mune responses. Based on previous 
vaccination studies, the main imm.unogenic glycoproteins in HSV appear lo be gB and gD, 
25 which are encoded by 2.7 kb and 1.2 kb genes, respectively. The gB and gD genes of HSV- 
1 are about 85% identical to the corresponding gene of HSV-2, and the gB genes oi~each 
share little sequence identity with the gD genes. Baboon HSV-2 gB is appr. 75% identical to 
human HSV-1 or -2 gB, with rather long stretches of almost 90% identity. In addition, 60- 
75% identity is found in portions of the genes of equine and bovine herpesviruses. 
30 Family shuffling is employed using as substrates nucleic acids that encode gB 

and/or gD from HSV-1 and HSV.2. Preferably, homologous genes are obtained from HSVs 
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of various strains. Aj\ alignment of gD nucleotide sequences from HSV-I and two strains of 
.HSV-2is,shown in Figure 7. Antigens encoded by the shuffled nucleic acids are expressed 
and analyzed in vivo. For example, one can screen for improved induction of neutralizing 
antibodies and/or CTL responses against HS V- l/HSV-2. One can also detect protective 
5 immunity by challenging mice or guineal)i'gs"\vithnhe'virusesr"Screening can-be-done-using— 
pools or individuals clones. 

Example 17 

Evolution Of HIV Gpl20 Proteins For Induction Of Broad Spectrum Neutralizing Ab 

Responses 

10 This Example describes the. use of DNA shuffling to generate immunogens 

that crossreact among different strains of viruses, unlike the wild-tvpe immunogens. 
Shuffling two kinds of envelope sequences can generate immunogens that induce 
neutralizing antibodies against a third strain. 

.Ajitibody-mediared neutralization of KIV- 1 is stnctly upe-specific, 

15 Although neutralizing activity broadens in ini'ected individuals over time, induction of such 
antibodies by vaccination has been shown to be extremely difficult. .Antibody-mediated 
protection from HIV-1 infection in vivo correlates with antibody-mediated neutralization of 
virus in vitro. 

Figure S illustrates the generation of libraries of shuffled gpl20 genes. gpl20 
20 genes derived from HIV-1DH12 and HIV-1IIIB(NL43) are shufiled. The chimeric/mutant 
gpl20 genes are then analyzed for their capacity to induce antibodies that have broad 
spectrum capacity to neutralize different strains of HIV. Individijal "shuffled gpl20 genes'are 
incorporated into genetic vaccine vectors, which are then introduced to mice by injection or 
topical application onto the skin. These antigens can also be delivered as purified . 
25 recombinant proteins. The immune responses are measured by analyzing the capacity of the 
mouse sera to neutralize HIV growth in vitro. Neutralization assays are performed against 
HIV-1DH12, HIV-IIIIB and HIV- 189.6. The chimeras/mutants that demonstrate broad 
spectrum neutralization are chosen for further rounds of shuffling and selection. Additional 
studies are performed in monkeys to illustrate the capacity of the shuffled gpl20 genes to 
30 provide protection for subsequent infection with immunodeficiency virus. 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 , PCT/US99/02944 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that' various modifications or changes. in light thereof will be 
suggested to persons skilled in the an and are to be included within the spirit and pur\'iew of , 
this appHcation and scope of the appended claims. All; publications, patents, and patent 
applications cited herein are hereby incorporated by reference for all purposes. 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 



118 



PCT/US99/02944 



WHAT IS CLAIMED IS 

1 1 . recombinant multivalent antigenic polypeptide that comprises a first 

2 antigenic determinant of a first polypeptide and at least a second antigenic determinant from 
3 a second polype ptide. ^ 



1 2. The multivalent antigenic polypeptide of claim i, wherein the 

2 polypeptide comprises at least a third antigenic determinant from a third polypeptide. 

1 3. The multivalent antigenic polypeptide of claim i, wherein the first and 

2 second polypeptides are selected from the group consisting of cancer antigens, antigens 

3 associated with autoimmunity disorders, antigens associated with inflammatory conditions. 

4 antigens associated with allergic reactions, and antigens from infectious agents. 

1 4. The multivalent antigenic polypeptide of claim 3. wherein the antigens 

2 are from a virus, a parasite, or a bacteria. 

1 . 5. The multivalent antigenic polypeptide of claim 4, wherein the antigens 

2 are from a virus selected from the group consisting of a Venezuelan equine encephalitis virus 

3 or a related alphavirus, a virus of the Japanese encephalitis virus complex, a virus of the tick- 

4 borne encephalitis virus complex, a Dengue virus, a Hania virus, an HIV, a hepatitis B virus, 

5 a hepatitis C virus, and a Herpes simplex virus. 



1 6: The multivalent antigenic polypeptide of claim 5, wherein the anhgens 

2 are envelope proteins. 

1 7. The multivalent antigenic polypeptide of claim 4, wherein the antigens 

2 are from a bacteria and are selected from the group consisting of a Yersmia V, antigen, a 

3 Staphylococcus aureus enterotoxin, a Streptococcus pyogenes enterotoxin, a Vibrio cholera 

4 toxin, an enterotoxigenic Escherichia coli heat labile enterotoxin, a OspA and a OspC 

5 polypeptide from a Borrelia species, an Antigen 85 polypeptide from a Mycobacterium 

6 species, a VacA and a CagA polypeptide from Helicobacter pylori, and an MSP antigen 

-7 {rom-Plasmodium-faleiparum ^ — : , ^ 
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1 8. The multivalent antigenic polypeptide of claim 1, wherein the 

2 multivalent antigenic polypeptide-exhibits reduced affinity to IgE from a mammal compared 

3 to the first or second polypeptides. 

1 9. The multivalent antigenic poIj^Deptide of claim U wherein the first 

2 antigenic determinant and the second antigenic determinant are fi'om different serotypes of a 

3 pathogenic organism. 

1 10. The multivalent antigenic-polypeptide of claim 1 , wherein the first 

2 antigenic determinant and the second antigenic determinant are from different species of 

3 . pathogenic organism. 

1 11. The multivaient antigenic poi>peptide of claim l\ wherein :he first 

2. polypeptide and the second poiv^Deptide are allergens; 

1 12. The multivalent antigenicpoivpeptide of claim 1 !, wherein the 

2 allergens are dust mite allergens, grass pollen allergens, birch pollen allergens, ragweed. 

3' pollen allergens,, hazehpollen allergens, cockroach. allergens,. rice allergens, oHve tree.pollen. 

4 allergens, fungal-allergens, mustard allergens,. and. bee venom. ' 

1 13, The muhivalem antigenic polypeptide of claim 1 , wherein ^he first 

2 polypeptide and the second polypeptide are associated with an inflammatory or autoimmune 

3 disease. , 

1 14.' The multivalent antigenic poKpeptide of claim 13, wherein the first 

2 polypeptide and the second polypeptide are autoanti gens associated with a disease selected 

3 from the group' consisting of multiple sclerosis, scleroderma, systemic sclerosis, systemic 

4 lupus erythematosus, hepatic autoimmune disorder, skin autoimmune disorder, insulin- 

5 dependent diabetes mellitus, thyroid autoimmune disorder, and rheumatoid arthritis. 

1 15. The multivalent antigenic polypeptide of claim 1, wherein the first 

2 polypeptide and the second polypeptide are cancer antigens or sperm antigens. 
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1 16. A recombinant antigen library comprising recombinant nucleic acids 

2 that encode antigenic polypeptides, wherein the library is obtained by recombining at least 

3 first and second forms of a nucleic acid which comprises a polynucleotide sequence that 
-^4_~encodes-a-disease-associated^antigenic^polypeptide;~vv-herein4he-first-and^second^fom 

5 from each other in two or more nucleotides, to produce a library of recombinant nucleic 

6 acids. 

1. 17. The recombinant antigen , library of claim 16, wherein the first and 

2 second polypeptides are toxins. 



1 • IS. A method of obtaining a poivTiucieotide that encodes a recombinant 

2 antigen having improved ability to induce an immui-e response to a disease condition, the 

3 method comprising: 

4 (1) recombining at least first and second forms of a nucleic acid which 

5 comprises a pohoiucleotide sequence that encodes an antigenic polypeptide that is associated 

6 with the disease condition, wherein the first and second forms differ ft*om each other in two 

7 or more nucleotides, to produce a Hbran.^ of recombinant nucleic acids; and 

8 (2) screening the librar\' to identify at least one optimized recombinant . 

9 nucleic acid that encodes an optimized recombinant antigenic polypeptide that has improved 
10 ability to induce an immune response to the disease condition. 

- 1 V 19. - The method of claim 18^ wherein the -method furiher comprisesi • - 

2 . (3) recombining at least one optimized recombinant nucleic acid with a 

3 further form of the nucleic acid, which is the same or different fi-om the first and second 

4 forms, to produce a further library of recombinant nucleic acids; 

5 (4) screening the further library to identify at least one further 

6 ' optimized recombinant nucleic acid that encodes a polypeptide that has improved ability to 

7 induce an immune response to the disease condition; and 

8 (5) repealing (3) and (4), as necessary, until the fiirther optimized 

9 recombinant nucleic acid encodes a polypeptide that has improved ability to induce an 

10 immune response to the disease condition. 
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20. The method of claim 18, wherein the disease-associated polypeptides 
are selected from the group consisting of cancer antigens, antigens associated with 
autoimmunity disorders, antigens associated witH inflammatorv* conditions, antigens 
4 associated with allergic reactions, and antigens associated with infectious agents. 

1 ■ , ' 21. The method of claim 18, wherein the disease condition is an infectious 

2 disease and the first and second forms of the nucleic acid each encode an antigen of a 

3 different serotype of a pathogenic agent. 

1 22. . The method of claim 21, wherein the first and second forms of the 

2 nucleic acid are each from a different species of pathogen, 

1 23. ' The method of claim 21, wherein the screening is accomplished by: . ' 

2 introducing into a test animai either: , 

3 ' a) the librar\' of recombinant nucleic acids,, or 

4 , , b) recombinant polypeptides encoded by the library of recombinant 

5 nucleic acids; ' 

5; ' introducmg the pathogenic agent into the test animal; and 

- 7 " . ' determining whether the test animal is resistant to challenge by the 

S pathogenic agent. 

1 24. The method of claim 23, wherein the pathogenic agent introduced into 

2 - the test animal is of a different serotype than that used as a source of the first and s^econd 
' 3 / forms of the nucleic acid. 

1 . 25. The method of claim 23, wherein the librar>' is subdivided into a 

2 . plurality of pools, each of which pools is introduced into a test animal to identify those pools 

3 that include an optimized recombinant nucleic acid that encodes a polypeptide which has 



4 improved abiUty to induce an immune response to the pathogenic agent. 



1 



26. The method of claim 25, wherein the pools that include an optimized 
2 recombinant nucleic acid are further subdivided into a plurality of subpools, each of which 
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3 subpools is introduced into a test animal to identify those pools that include an optimized 

4 recombinant nucleic acid that encodes a polypeptide which has improved ability to induce 

5 immune response to the pathogenic agent. 

1 27. . The me.thod„ofxJaim,LS.,^vvh.ereinJhe„optimized.recombinani-nucleic--- 

2 acid encodes a multivalent antigenic pol\peptide and the screening is accomplished by: 

3 expressing the library of recombinant nucleic acids in a phage display 

4 expression vector such that the recombinant antigen is expressed as a fusion protein with a 

5 phage polypeptide that is displayed on a phage particle surface; 

6 contacting the phage with a first antibody that is specific for a first 

7 serotype of the pathogenic agent and selecting those phage that bind to the first antibody; 

8 contacting those phage that bind lo the first anribody with a second 

9 antibody that is specific for a second seroiy-pe of the pathogenic agent and selecting those 

10 phage that bind to the second antibody; 

1 1 wherein those phage that bind *o the firsi it.ntibody and the second 

12 antibody express a multivalent antigenic polypeptide. 

• 1 28. The method of claim 2 7, wherein the screening timber compnses 

2 contacting those phage that bind to the first and second antibodies With one or more 

3 additional antibodies, each of which is specific for an additional serotype of the pathogenic 

4 • agent, and selecting those phage that bind to the respective additional antibodies. 

1 29. The method of claim 27; wherein the phage display expression vector 

2 comprises a suppressible stop codon between the recombinant nucleic acid and the phage 

3 polypeptide, whereby expression in a host cell which comprises a corresponding suppressor 

4 tRNA results in production of the fusion protein and expression in a host cell which lacks a 
5, corresponding suppressor tRNA results in production of the recombinant, antigen not as a 

6 fusion protein. 

1 30. The method of claim 18, wherein the optimized recombinant antigen 

2 exhibits an enhanced expression level in a host cell and the screening is accomplished by- 

3 expression of each recombinant nucleic acid in the host cell and subjecting the host cells to 
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now cytometry-based cell sonirig to obtain.those host.cells that display the recombinant 
antigen on'thehost cell surface. , " 

31. The method of claim 18, wherein the improved property is selected . 

from the group consisting of: ,. , : . ■ 

. improved irnmunogenicity; . ■ . . , _ " ' 

r .' .. enhanced cross-reactivity against aifferem forms of the disease- , . 

associated antigenic pol>T3eptide; ^ ' ... 

reduced toxicity; ■ ' ■ : . 

improved'adjuvant activity m v'lVo; and . ■ : 

. improved production of the in-Lmunogenic polypeptide. 

32. the method , of claim 3,1, wherein the improved propeny is enhanced 
• cross-reactivity against differenf forms of the disease-associated poiyiDeptide and the first 

and second forms of the nucleic acid are. from a. first ar.d a second fomi.of the disease- 
associated polypeptide. 

■33.. Themethod of claim 32, wherein the first and second formsof the 

disease^associated polypeptide .are obtained from at least a first and second species of a . 
pathogenic agent and the optimized recombinant nucleic acid encodes a recombin^^^^^^ . 

polypeptide that induces a protective response against both species of the pathbgemc agem. , 

34. The method of claim 33, wherein'the recombinant polypeptide induces a 
protective response against at least one additional species of the pathogenic agent.. 

. . 35."' The.method ofclaim 33, wherein the,pathogenicagent_is a toxin. , 
36. The method ofclaim 33, wherein the pathogenic agent is a virus or a 

cell. 

■ 37; The method ofclaim 33, wherein the disease-associated polypeptide is a 
Yersinia V-antigen. 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 



124 



PCT/US99/02944 



1 , 38. The method of claim 37, wherein the at least first and second forms of a 
2 nucleic acid are obtained from at least a first and second species of Yersinia. 

1 39. T he method_QfclaimJ.8..3Mherein-the,rgr5/>?/a-species-are^selected^from 

2 the group consisting of y. Y, enierocolitica, and y. pseudotuberculosis. 

1 40. The method of claim 33. wherein the pathogenic agent is a bacterial ■ 

2 toxin. 

1 41. The method of claim 18, wherein the disease condition is cancer and the 

2 screening step involves introducing the optimized recombinant nucleic acids into a genetic 

3 vaccine vector and testing libran* members for ability lo inliibit proliferation of cancer cells 

4 or inducing death of cancer cells. 

1 . 42. The method of claim 41. wherein the optimized recombinant nucleic 

2 acid comprises a nucleotide sequence that encodes a':umor specific antigen. 

1 43. The method of claim 41, wherein the optimized recombinant nucleic 

2 acid comprises a nucleotide sequence that encodes a moiecule which is capable of inhibiting 

3 proliferation of cancer cells. 

1 44. The method of claim 18, wherein the disease condition is an 

2 inflammatory response which has an unknown or no antigen specificity and the screening 

3 step involves one or more of the follpwing: . 

4 . a) determining the ability of the genetic vaccine vector to induce 

5 ' cytokine production by PBMC, synovial fluid cells, purified T cells, 

6 monocytes/macrophages, dendritic cells, or T cell clones; 

7 . b) determining the ability of the genetic vaccine vector lo induce T cell 

8 activation or proliferation; and 

* 9 c) determining the ability of the genetic vaccine vector to induce T cell 

.10 differentiation to ThI or Th2 cells. 
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1 ' , 45. The'method of claim 18, wherein the disease conditipni^ 

2 auioimmune response. . 

,j, . 45. The method of claim 45, wherein the optimized recombinant antigenic 

2. polypeptide shifts the immune response from a In 1 -mediated response to a TH2-mediated 
o' " response. ' ' . ' . ■ 

1 „ . .47. The method of claim IS, wherein the disease condition is an allergic < 

2 . immune response. . ' , ; - . 



1 , / . 48. /The method ofclaim 47, vvhereinthe.optimized recombinant antigenic . 

2 polypeptide .shifts the immune response from a TH2-mediated- response to a THl-mediated 

3 response. , . . ^ ^ ; 

1 ' . • 49. , The method of claim 47, whereir; ihe optimized recombinant antigenic 

2 ' ■ polypeptide induces an immune response characterised by predominant IgG and IgM 

3 expression and reduced IgE expression. \ 

1 ; " ' s'o*, ■ The method of claim 47, wherein the optimized recombiria^^ 

'2 polypeptide is not recognized by pre-existing. IgE molecules present, in sera of atopic 
3 ■ ' mammals. ■ . . 

1 51. The method ofclaim 50, wherein the optimized recombinant antigenic 

2 polypeptide retains T cell 'epitopes that are involved in modulating a T cell response. 

P ' 52. Amethodofobtaininga recombinant viral vector which has an 

2 enhanced: abiliW tolnduce an antiviVal response in- a cell, ih? method comprising the steps of: 

3 * '(1) recombining.at least first and second. forms a 

4 • compnse a viral vector, wherein the first and second foms differ from each other' in two or 

5 more nucleotides,ioproducealibrary of recombinant viral vectors; 

5 ' (2) transfeciingthelibrary of recombinant viral vectors into a 

7 population of mammalian cells; 

8 (3) staining the cells for the presence of Mx protein; and 
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9 ' (4) isolating recombinant viral vectors from cells which stain positive 

10 for Mx protein, wherein recombinant viral vectors from positive staining cells exhibit 

1 1 enhanced ability to induce an antiviral response. 

_J 53 . The method of claim 52, wherein the viral vector comprises an 

2 influenza viral genomic nucleic acid. - 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 5 
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Figure 8 A 



Expression of shuffled envelope gp 120 protein in a genetic vaccine 
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■ Figure 8B 

PGR primers for genetic vaccine for HIV 



6221 . the beginning of gpl 20 



7747, the, end of gpl20 



gpl20 



gp4i 



6025F CAAGCTTCTCTATCAA.AGCAGTAAGTAGTAC 
"773R ' CTtCCTCCTGCTCCCrVAGAACCCAA' 

6196F ATAG,\.AAGAGCAG.V\GACAGTGGCA 
^ 7746R • .VACAAAGGTCGTATTCCCACTGCTC ' 



,BssH2 



: Primers to generate 

shuffling materials ' 

- Primers to amplirs; 

shuffled matenals • -} 
■ EcoRl ' /' ' ' . • • 

Primers to clone the PGR 

BssH2-6205F nggcgcGCAGAAGACAGTGGGA.\TGAGAGTG • : . . ^ 

Ends next to ihc unique blunt site to be ibiroGuccd to , tragment mtO genetic 
the vector vaccine vector 



SUBSTITUTE SHEET (RULE 26) 



wo 99/41383 



PCT/US99/02944 



10/21 
Figure 9 



Domain Structure of Hepadnavirus Envelope Genes 
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/Figure 1 ] 

Shuffle genes 'for which the immuno'genicity 
of one antigenic domain, is improved: 
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. , Figure 12 

Transmembrane organization of the HBsAg poiyp,eptide 
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Figure 13 



Phage Display for Allergens not 
Recognized by Pre-existing IgE 
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Remove variants that 
bind to IgE derived 
from atopic patients 



^ Repeat until no 
binding is observed 
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' Figure 14 



Screening of Allergen Variants that 
Efficiently Activate Th cells from Atopics 





IFN^y 



I 




1^11-4 i 



i IL-2 



(Tcell 




PBMC or T cell •clones 
frorrr atopic patients ■ 

^ Activation with shuffled 
allergen' variants 

[ y T cell proliferation 

i ^ Cytokine 'synthesis 

; (lL-2, lL-4, IFN-y); ■ • 

^.In' vivo testing 



SUBSTITUTE SHEET (RULE 25) 



wo 99/4 1383 



PCT/US99/02944 



14/21- 
Figure 15 



Screening of Cancer Antigens that 
Efficiently Activate T cells from Patients 






^ PBMC or T cell clones 
from Cancer patients 

^ Activation with shuffled 
antigen variants 
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T ceil /CTL activity 
Gytol<ine synthesis 
(IL-2, IL-4, IFN-y) . 



In vivo testing 
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EigureJ7j_S.eq.uences„o.f_PreS2-SXoding-Regions.oLD,ifferent-HBsAg 



M38454, HPB.ADRICG Hepatitis B virus, complete genome. Shown is HBsAg coding 
region (SEQ ID NO: 10); PreS2 starts at 1811 of complete genome sequence, S starts at 
1976, end at 2656. 

ATGCAGTGGAACTCCACAACATTCCACCAAGCTCTGCTAGACCCCAGAGTGAGGGGCCTATA 
CTTTCGTGCTGGTGGCTCCAGTTGCGGAACAGTAAAGGCTGTTGCGACTACTGCCTCACCCA 
TATCGTCAATCTTCTCGAGGACTGGGGACCCTGCACGGAACATGGAGAACACAACATCAGGA 
TTCCTAGGACCCCTGCTCGTGTTACAGGGGGGGTTTTTCTTGTTGACAAGAATCCTCACAAT 
ACCACAGAGTCTACACTCGTGGTGGACTTCTCTGAATTTTGTAGGGGCAGCACCCACGTGTC 
TTGGCG.AAAATTCGCAGTCCGCAhCCTCGAATGACTCACCAACCTCTTGTCGTCCAA.TTTGT 
CCTGGTTATCGTTGGATGTGTCTGCGGCGTTTTATCATATTGCTCTTCATCCTGCTGCTATG 
CCTGATCTTCTTGTTGGTTCTTCTGGACTACC.ViGGTATGTTGTrTGTTTGTCCTCTAGTTC 
CAAGAAGATCAAGTACCAGCAGGGGACGATGCAAGACCTGGACGA7TCCTGGTG.AAGGAACC 
TCTATGTTTCGCTCTTGTTGGTGTACAAAACCTTCGGAGGGAAACTGCAGTTGTATTGGGAT i 
CGGATCATCTTGGGCTTTCGCAAGATTGCTATGGGAGTGGGGCTCAGr::CGTTTGTCCTGGC 
TCAGTTTAGTAGTGGGATTTGTTGAGTGGTT2GTAGGGGTTTCGCCCACTGTTTGGGTTTCA 
GTTATATGGATGATGTGGTATTGGGGGCCAAGTGTGTAGAACATCTTGAGTCGGTTTTTAGC 
TCTATTAGGAATTTTGTTTTGTCTTTGGGTATACATTTGA 

HBsAg Protein adr (SEQ ID NO: 1 1): 

MQWNSTTFHQALLD?RVRGLYF?AGGSSSGTWPV?TTAS?IS'SI?SRTGD?APmENTT3G 
FLGPLLVLQAG?FLLTRILTI?Q5LHSWWTSLNFLGAAPTCLGQNSQS?TSNHS?TSCP?IC 
PGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLSVGPLLPRTSTTSTGPCKTGTIPAQGT 
SMFPSSCCTKPSDGNCTCIFIPS5WAFARFLWEWASVRFSWLSLLVPPVQWFVGLSPTWLS 
VIWMMWYWGPSLYMILSPFLPLLPIFFCLW^IZ ■ 
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J02203; HPBAYW Hepatitis B Virus a>"»v. (SEQ ID NO: 1 1) 

ATGCAGTGG^WVTTCCACAACCTTCCACCAAACTCTGCAAGATCCCAGAGTGAGAGGCCTGTA 
TTTGCCTGCTCG-GGCTCCAGTTCAGGA^CAGMXCCTGTTCTGACTACTGCCTCTCCCT 
TATCGTCAATCT'--CTCGAGGATTGGGGACCCTGCGCTG^J^CATGGAGAACATCACATCAGGA 
TTCCTAGGACCCCTTCTCGTGTTACAGGCGGGGTTTTTGTTGTTGACAAGAATCCTCACAAT 
laaaCAGAGTCTAaACTGGTGGTGGACTTCTCTCAATTTTCTAG^ 

TTGGCCAAAATTCGCAGTCCCCAACCTCCAATCACTCACCAACCTCTTGTCCTCCAAC - ToT 
CCTGGTTATCGCTGGATGTGTCTGCGGCGTTTTATCATCTTCCTCTTCATCCTGCTGCTATG 
rOTrATCTTCTTGTTGGTTCTTCTGGACTATCAAGGTATGTTGCCCGTTTGTCCTCTAATTC 
CAGGATCCTCAACAACCAGCACGGGACCATGCCGGACCTGCATGACTACTGCTCAAGGAACC 
TCTATGTATCCCTCCTGTTGCTGTACCAAACCTTCGGACGGAAATTGCACCTGTATTCCCAT 
rrrATCATCCTGGGCT-^TCGGAAAATTCCTATGGGAGTGGGCCTCAGCCGGTTTCTeCTGGC 
TCAGTTTACTAGTGCCATTTGTTCAGTGGTTCGTAGGGCTTTCCCCCACTGTTTGGCTTTCA 
GTTATATGGATGATGTGGTATTGGGGGeCAAGTGTGTACAGCATCTTGAGTCGGTTTTTACC 
GCTGTTACCAATTTTCTtTTGTCTTTGGGTATACATTTAA , 

Forward HBV aV>v PGR Primer (with 2 natural EcoRI sites) (SEQ ID NO: 12): 
5' -CCGGGAATfcCTCGACACCATGCAGTGGAATTCCACAACC-3 '■ , ' 

Reverse HBV avyv PGR Primer (with added Kpnl site2(SEQ ID NO: 13): 

5 ' -CCGGGGTACGCAAAGACAAAAGAAAATTGGT;^--vCAGCGG-3 ' . . 

FLGPLLVLQAGF^LLTRILTIPQSLDSWWTSLJ^FLGGTTVCLGQN^ 

S?R5^SRRFliFLFVLLLCLr?LLVLLDYQGML?VC?LIPGSSTTSTGPCRTCMTTAQG^ 

sm??^StSsdgnctcipi?sswafgkflwewasarfswlsllvpfvqwfvgls?™s 
viwmmvjywgpsly.sil3pflpllpiffclwvyiz ■. ' 
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J04514 OHVHEPBA Woodchuck hepatitis B virus genome.(WHV'8) (SEQ ID NO: 15) 
(PNAS 86, 1846-1849, 1989). ATCC No. 45097. EcoRI site at pes. 1 cuts the PreSl 
region. HBs(S) ORF: 296-964. PreS2 starts at 116; PreSl starts at 3039. 

ATGA7VAAATCAGACTTTTCATCTCCAGGGGTTCGTAGACGGATTACGAGACTTGACAACAAC 

GGAACGCCAACACAATGCCTATGGAGATCCTTTTACAACACTAAGCCCTGCGGTTCCTACTG 

'TATCCAi:CA"TATTGT'CTCCTCCCTCGArGA'CTGGGGA"CCCTGCArTGTCA'CCGGAG^ 

CCATCAAGTCTCCTAGGACTCCTCGCAGGATTACAGGTGGTGTATTTCTTGTGGACAAAAAT 

CCTAACAATAGCTCAGAATCTAGATTGGTGGTGGACTTCTCTCAGTTTTCCAGGGGGCATAC 

CAGAGTGCACTGGCCAAAATTCGCAGTTCCAAACTTGCAAACACTTGCCAACCTCCTGTCCA 

CCAACTTGCAATGGCTTTCGTTGGATGTATCTGCGGCGTTTTATCATATACCTATTAGTCCT 

GCTGCTGTGCCTCATCTTCTTGTTGGTTCTCCTGGACTGGAAAGGTTTAATACCTGTCTGTC 

CTCTTCAACCCACAACAGAAACAACAGTCAATTGCAGACAATGCACAATCTCTGCACAAGAC 

ATGTATACTCCTCCTTACTGTTGTTGTTTAAAACCTACGGCAGGAAATTGCAGTTGTTGGCC 

CATCCCTTCATCATGGGCTTTAGGAAATTACCTATGGGAGTGGGCCTTAGCCCGTTTCTCTT 

GGCTCAATTTACTAGTGCCCTTGCTTCAATGGTTAGGAGGAATTTCCCTCATTGCGTGGTTT 

TTGCTTATATGGATGATTTGGTTTTGGGGGCCCGCACTTCTGAGCATCTTACCGCCATTTAT 

TCCCATATTTGTTCTGTTTTTCTTGATTTGGGTATACATTTGA 

Fonvard WHY PCR Primer (with added EcoRI site) (SEQ ID NO; 16): 
5 ' - CCGGGAATTCTCATCTCCAGGGGTTCGTAGACGGATTACG - 3 ' 



Reverse WHV PCR Primer (with added Kpnl site) (SEQ ID NO: 1 7); 
- ccggggtacccaaatcaagaaaaacagaaca;^tatggg- 3 ' 

WHV Surface Antigen Protein (SEQ ID NO: 18): 

MKNQTFHLQGFVDGLRDLTTTr.RQ:-iNAYGDPFTTLS?AVPTVSTIL5??STTGD?AI^ 

PSSLLGLI^GLQVVYFLWTKIL-TIAQNLDWWWTSLSrPGGIPECTGQNSQFQTCKHL^ 

PTCNGFRWMYLRRFIIYLLVLLLCLIFLLVLLDWKGLIPVCPLQPTTETTVNCRQCTISAQD 

MYT?PYCCCLKPTAGNCTCWPI?SSWALG^IYLWEWAIJU^FSV?LNLLV?L■LQV)LGGISLIAWF 

LLIWMIWFWGPALLSILPPFIPIFVLFFLIWVYIZ 
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Primers for PCR amplification of the , large iragments containing the S2S coding • 
sequences (primers are about 200 bp outside the desired sequences) 

AYWSHFOR( SEQ ID NO: 21).' _ '. ' ' ' 

5' -GCGGGCAGGAAGGAAATGGG-3' (ISGC - , /AT ■= Ttc: 66) . . 

AY,WSHREV(SEQIDNO;-22) ,. 

5 ' -CTGGTATTGTCTTCeCAATCCTC-3 ' (ilGC -r 12AT = Tm 68) 

WHVSHFOR fSEQ ID NO: 23) / ,. ' ■ 

S' -CGGGACATACCACGTGGtTTAG-3" ;i2GC r lOAT = Tm 68r, ■ 

WHVSHREV (SEQ ID NO: 24). , • _ 

5 ' -GGCATTAAAGCAGGGTATCCAC-3 ; . iLlGC ,r 1 lAT =■ -.it, o5;;- 
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